Keras variable() memory leak

up vote
0
down vote

favorite

I am new to Keras, and tensorflow in general, and have a problem. I am using some of the loss functions (binary_crossentropy and mean_squared_error mainly) to calculate the loss after prediction. Since Keras only accepts it's own variable type, I am creating one and supply it as an argument. This scenario is executed in a loop (with sleep) as such:

Get appropriate data -> predict -> calculate the lost -> return it.

Since I have multiple models that follow this pattern I created tensorflow graphs and sessions to prevent collision (also when exporting the models' weights I had problem with single graph and session so I had to create distinct ones for every single model).

However, now the memory is rising uncontrollably, from couple of MiB to 700MiB in couple of iterations. I am aware of Keras's clear_session() and gc.collect(), and I use them at the end of every iteration, but the problem is still present. Here I provide a code snippet, which is not the actual code, from the project. I've created separate script in order to isolate the problem:

import tensorflow as tf



from keras import backend as K

from keras.losses import binary_crossentropy, mean_squared_error



from time import time, sleep

import gc

from numpy.random import rand



from os import getpid

from psutil import Process



from csv import DictWriter

from keras import backend as K



this_process = Process(getpid())



graph = tf.Graph()

sess = tf.Session(graph=graph)



cnt = 0

max_c = 500



with open('/home/quark/Desktop/python-test/leak-7.csv', 'a') as file:

    writer = DictWriter(file, fieldnames=['time', 'mem'])

    writer.writeheader()



    while cnt < max_c:  

        with graph.as_default(), sess.as_default():         

            y_true = K.variable(rand(36, 6))

            y_pred = K.variable(rand(36, 6))



            rec_loss = K.eval(binary_crossentropy(y_true, y_pred))

            val_loss = K.eval(mean_squared_error(y_true, y_pred))



            writer.writerow({

                'time': int(time()),

                'mem': this_process.memory_info().rss

            })



        K.clear_session()

        gc.collect()



        cnt += 1

        print(max_c - cnt)

        sleep(0.1)

Additionally, I've added the plot of the memory usage:
Keras memory leak

Any help is appreciated.

edited Nov 8 at 10:43

asked Nov 8 at 9:23

mladenk

Could you add the requred imports? I believe you are mixing tf and keras commands.
– Daniel GL
Nov 8 at 10:11

Yes, a full example that we can run would be nice.
– Matias Valdenegro
Nov 8 at 10:13

I've updated the code.
– mladenk
Nov 8 at 10:43

add a comment |

up vote
0
down vote

favorite

Get appropriate data -> predict -> calculate the lost -> return it.

import tensorflow as tf



from keras import backend as K

from keras.losses import binary_crossentropy, mean_squared_error



from time import time, sleep

import gc

from numpy.random import rand



from os import getpid

from psutil import Process



from csv import DictWriter

from keras import backend as K



this_process = Process(getpid())



graph = tf.Graph()

sess = tf.Session(graph=graph)



cnt = 0

max_c = 500



with open('/home/quark/Desktop/python-test/leak-7.csv', 'a') as file:

    writer = DictWriter(file, fieldnames=['time', 'mem'])

    writer.writeheader()



    while cnt < max_c:  

        with graph.as_default(), sess.as_default():         

            y_true = K.variable(rand(36, 6))

            y_pred = K.variable(rand(36, 6))



            rec_loss = K.eval(binary_crossentropy(y_true, y_pred))

            val_loss = K.eval(mean_squared_error(y_true, y_pred))



            writer.writerow({

                'time': int(time()),

                'mem': this_process.memory_info().rss

            })



        K.clear_session()

        gc.collect()



        cnt += 1

        print(max_c - cnt)

        sleep(0.1)

Additionally, I've added the plot of the memory usage:
Keras memory leak

Any help is appreciated.

edited Nov 8 at 10:43

asked Nov 8 at 9:23

mladenk

Could you add the requred imports? I believe you are mixing tf and keras commands.
– Daniel GL
Nov 8 at 10:11

Yes, a full example that we can run would be nice.
– Matias Valdenegro
Nov 8 at 10:13

I've updated the code.
– mladenk
Nov 8 at 10:43

add a comment |

up vote
0
down vote

favorite

Get appropriate data -> predict -> calculate the lost -> return it.

import tensorflow as tf



from keras import backend as K

from keras.losses import binary_crossentropy, mean_squared_error



from time import time, sleep

import gc

from numpy.random import rand



from os import getpid

from psutil import Process



from csv import DictWriter

from keras import backend as K



this_process = Process(getpid())



graph = tf.Graph()

sess = tf.Session(graph=graph)



cnt = 0

max_c = 500



with open('/home/quark/Desktop/python-test/leak-7.csv', 'a') as file:

    writer = DictWriter(file, fieldnames=['time', 'mem'])

    writer.writeheader()



    while cnt < max_c:  

        with graph.as_default(), sess.as_default():         

            y_true = K.variable(rand(36, 6))

            y_pred = K.variable(rand(36, 6))



            rec_loss = K.eval(binary_crossentropy(y_true, y_pred))

            val_loss = K.eval(mean_squared_error(y_true, y_pred))



            writer.writerow({

                'time': int(time()),

                'mem': this_process.memory_info().rss

            })



        K.clear_session()

        gc.collect()



        cnt += 1

        print(max_c - cnt)

        sleep(0.1)

Additionally, I've added the plot of the memory usage:
Keras memory leak

Any help is appreciated.

edited Nov 8 at 10:43

asked Nov 8 at 9:23

mladenk

Get appropriate data -> predict -> calculate the lost -> return it.

import tensorflow as tf



from keras import backend as K

from keras.losses import binary_crossentropy, mean_squared_error



from time import time, sleep

import gc

from numpy.random import rand



from os import getpid

from psutil import Process



from csv import DictWriter

from keras import backend as K



this_process = Process(getpid())



graph = tf.Graph()

sess = tf.Session(graph=graph)



cnt = 0

max_c = 500



with open('/home/quark/Desktop/python-test/leak-7.csv', 'a') as file:

    writer = DictWriter(file, fieldnames=['time', 'mem'])

    writer.writeheader()



    while cnt < max_c:  

        with graph.as_default(), sess.as_default():         

            y_true = K.variable(rand(36, 6))

            y_pred = K.variable(rand(36, 6))



            rec_loss = K.eval(binary_crossentropy(y_true, y_pred))

            val_loss = K.eval(mean_squared_error(y_true, y_pred))



            writer.writerow({

                'time': int(time()),

                'mem': this_process.memory_info().rss

            })



        K.clear_session()

        gc.collect()



        cnt += 1

        print(max_c - cnt)

        sleep(0.1)

Additionally, I've added the plot of the memory usage:
Keras memory leak

Any help is appreciated.

python tensorflow memory-leaks keras

edited Nov 8 at 10:43

asked Nov 8 at 9:23

mladenk

edited Nov 8 at 10:43

asked Nov 8 at 9:23

mladenk

edited Nov 8 at 10:43

asked Nov 8 at 9:23

mladenk

asked Nov 8 at 9:23

mladenk

asked Nov 8 at 9:23

mladenk

Could you add the requred imports? I believe you are mixing tf and keras commands.
– Daniel GL
Nov 8 at 10:11

Yes, a full example that we can run would be nice.
– Matias Valdenegro
Nov 8 at 10:13

I've updated the code.
– mladenk
Nov 8 at 10:43

add a comment |

Could you add the requred imports? I believe you are mixing tf and keras commands.
– Daniel GL
Nov 8 at 10:11

Yes, a full example that we can run would be nice.
– Matias Valdenegro
Nov 8 at 10:13

I've updated the code.
– mladenk
Nov 8 at 10:43

Could you add the requred imports? I believe you are mixing tf and keras commands.
– Daniel GL
Nov 8 at 10:11

Yes, a full example that we can run would be nice.
– Matias Valdenegro
Nov 8 at 10:13

I've updated the code.
– mladenk
Nov 8 at 10:43

add a comment |

2 Answers
2

active

oldest

votes

up vote
1
down vote

I just removed the with statement (probably some tf code), and I don't see any leak. I believe there is a difference between the keras session and the tf default session. So you were not clearing the correct session with K.clear_session(). Probably using tf.reset_default_graph() could work too.

while True: 

    y_true = K.variable(rand(36, 6))

    y_pred = K.variable(rand(36, 6))



    val_loss = K.eval(binary_crossentropy(y_true, y_pred))

    rec_loss = K.eval(mean_squared_error(y_true, y_pred))



    K.clear_session()

    gc.collect()



    sleep(0.1)

answered Nov 8 at 10:25

Daniel GL

698316

I know that removing the with statement solves the problem but I am afraid I cannot omit it because of the different graphs I have. Regardless, thanks and I'll try with tf.reset_default_graph().
– mladenk
Nov 8 at 10:41

Then you just have to reset the appropriate graph, not just the default one
– Daniel GL
Nov 8 at 10:48

add a comment |

up vote
0
down vote

At the end, what I did was remove the K.variable() code out of the where statement. In that way, the variables are part of the default graph which is later cleared by K.clear_session().

answered Nov 9 at 11:18

mladenk

add a comment |

Your Answer

StackExchange.ifUsing("editor", function () {
StackExchange.using("externalEditor", function () {
StackExchange.using("snippets", function () {
StackExchange.snippets.init();
});
});
}, "code-snippets");

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "1"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53204747%2fkeras-variable-memory-leak%23new-answer', 'question_page');
}
);

Post as a guest

Name

2 Answers
2

active

oldest

votes

2 Answers
2

active

oldest

votes

up vote
1
down vote

while True: 

    y_true = K.variable(rand(36, 6))

    y_pred = K.variable(rand(36, 6))



    val_loss = K.eval(binary_crossentropy(y_true, y_pred))

    rec_loss = K.eval(mean_squared_error(y_true, y_pred))



    K.clear_session()

    gc.collect()



    sleep(0.1)

answered Nov 8 at 10:25

Daniel GL

698316

I know that removing the with statement solves the problem but I am afraid I cannot omit it because of the different graphs I have. Regardless, thanks and I'll try with tf.reset_default_graph().
– mladenk
Nov 8 at 10:41

Then you just have to reset the appropriate graph, not just the default one
– Daniel GL
Nov 8 at 10:48

add a comment |

up vote
1
down vote

while True: 

    y_true = K.variable(rand(36, 6))

    y_pred = K.variable(rand(36, 6))



    val_loss = K.eval(binary_crossentropy(y_true, y_pred))

    rec_loss = K.eval(mean_squared_error(y_true, y_pred))



    K.clear_session()

    gc.collect()



    sleep(0.1)

answered Nov 8 at 10:25

Daniel GL

698316

I know that removing the with statement solves the problem but I am afraid I cannot omit it because of the different graphs I have. Regardless, thanks and I'll try with tf.reset_default_graph().
– mladenk
Nov 8 at 10:41

Then you just have to reset the appropriate graph, not just the default one
– Daniel GL
Nov 8 at 10:48

add a comment |

up vote
1
down vote

while True: 

    y_true = K.variable(rand(36, 6))

    y_pred = K.variable(rand(36, 6))



    val_loss = K.eval(binary_crossentropy(y_true, y_pred))

    rec_loss = K.eval(mean_squared_error(y_true, y_pred))



    K.clear_session()

    gc.collect()



    sleep(0.1)

answered Nov 8 at 10:25

Daniel GL

698316

while True: 

    y_true = K.variable(rand(36, 6))

    y_pred = K.variable(rand(36, 6))



    val_loss = K.eval(binary_crossentropy(y_true, y_pred))

    rec_loss = K.eval(mean_squared_error(y_true, y_pred))



    K.clear_session()

    gc.collect()



    sleep(0.1)

answered Nov 8 at 10:25

Daniel GL

698316

answered Nov 8 at 10:25

Daniel GL

698316

answered Nov 8 at 10:25

Daniel GL

698316

answered Nov 8 at 10:25

Daniel GL

698316

I know that removing the with statement solves the problem but I am afraid I cannot omit it because of the different graphs I have. Regardless, thanks and I'll try with tf.reset_default_graph().
– mladenk
Nov 8 at 10:41

Then you just have to reset the appropriate graph, not just the default one
– Daniel GL
Nov 8 at 10:48

add a comment |

I know that removing the with statement solves the problem but I am afraid I cannot omit it because of the different graphs I have. Regardless, thanks and I'll try with tf.reset_default_graph().
– mladenk
Nov 8 at 10:41

Then you just have to reset the appropriate graph, not just the default one
– Daniel GL
Nov 8 at 10:48

I know that removing the with statement solves the problem but I am afraid I cannot omit it because of the different graphs I have. Regardless, thanks and I'll try with tf.reset_default_graph().
– mladenk
Nov 8 at 10:41

Then you just have to reset the appropriate graph, not just the default one
– Daniel GL
Nov 8 at 10:48

add a comment |

up vote
0
down vote

At the end, what I did was remove the K.variable() code out of the where statement. In that way, the variables are part of the default graph which is later cleared by K.clear_session().

answered Nov 9 at 11:18

mladenk

add a comment |

up vote
0
down vote

At the end, what I did was remove the K.variable() code out of the where statement. In that way, the variables are part of the default graph which is later cleared by K.clear_session().

answered Nov 9 at 11:18

mladenk

add a comment |

up vote
0
down vote

At the end, what I did was remove the K.variable() code out of the where statement. In that way, the variables are part of the default graph which is later cleared by K.clear_session().

answered Nov 9 at 11:18

mladenk

At the end, what I did was remove the K.variable() code out of the where statement. In that way, the variables are part of the default graph which is later cleared by K.clear_session().

answered Nov 9 at 11:18

mladenk

answered Nov 9 at 11:18

mladenk

answered Nov 9 at 11:18

mladenk

answered Nov 9 at 11:18

mladenk

add a comment |

draft saved

draft discarded

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Name

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Agfdhyk