tensorflow.keras save model in python and loading in Java

Question

I have a fined-tuned vgg model and I created the model using tensorflow.keras functional API and saved the model using tf.contrib.saved_model.save_keras_model. So the model is saved with this structure: assets folder which contains saved_model.json file, saved_model.pb file, and the variables folder, which contain checkpoint, variables.data-00000-of-00001 and variables.index.

I can easily load my model in python and get predictions using tf.contrib.saved_model.load_keras_model(saved_model_path), but I have no idea how to load the model in JAVA. I googled a lot and found this How to export Keras .h5 to tensorflow .pb? to export as pb file and then load it up following this link Loading in Java. I was not able to freeze the graph and also I tried to use simple_save but the tensorflow.keras does not support simple_save (AttributeError: module 'tensorflow.contrib.saved_model' has no attribute 'simple_save'). So can someone help me to figure out what steps are needed to load my model (tensorflow.keras functional API model) in JAVA.

Is the saved_model.pb file that I have, good enough to be loaded on the JAVA side? Do I need to create my input/output place holders? Then how can I export it?
I appreciate your help.

You can use TensorFlow Lite instead of tensorflow.org/lite

user9477964
– user9477964

2018-12-12 15:49:22 +00:00
Commented Dec 12, 2018 at 15:49 — user9477964
– user9477964, Commented Dec 12, 2018 at 15:49

Oliv · Accepted Answer · 2019-01-28 15:39:47Z

3

If you have a model saved in the SavedModel format (which it appears you do, and things like tf.contrib.saved_model.save_keras_model can help create), then in Java you can use SavedModelBundle.load to load and serve it. You do not need to "freeze" the model.

You may find the following useful:

Example program using the SavedModel format for an object detection model in Java
Slide deck about the Java API and the SavedModel format

But the basic idea is that your code will look something like:

try (SavedModelBundle model = SavedModelBundle.load("<directory>", "serve")) {
  try (Tensor<?> input = makeInputTensor();
       Tensor<?> output = model.session().runner().feed("INPUT_TENSOR", input).fetch("OUTPUT_TENSOR", output).run().get(0)) {
  // Use output
  }
}

Where "INPUT_TENSOR" and "OUTPUT_TENSOR" are the names of the input and output nodes in the TensorFlow graph. The saved_model_cli command-line tool installed when you install TensorFlow for Python can show you the names of these tensors in your model.

Note that using the TensorFlow Java API may be more suited to server/desktop applications than using TensorFlow Lite as suggested by another commenter. This is because the TensorFLow Lite runtime, while optimized (in terms of memory footprint etc.) for small devices, cannot export all models yet. While the TensorFlow Java API is using the exact same runtime and thus has the exact same abilities as TensorFlow for Python.

Hope that helps.

edited Jan 28, 2019 at 15:39

Oliv

10.9k4 gold badges60 silver badges78 bronze badges

answered Dec 20, 2018 at 0:01

ash

6,7714 gold badges20 silver badges30 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Hamid K Over a year ago

I did exactly the same and it worked fine finally when I was using Inception model as a pre-trained model. But when I use VGG model as a base, my model cannot be loaded in JAVA. Have you seen any tutorial which loads the VGG model and finetune it and then loaded in JAVA? I will send you the errors that I get when I try the VGG model. I do appreciate your help.

Hamid K Over a year ago

This is the error that I get: Matrix size-incompatible: In[0]: [1,8192], In[1]: [25088,256] [[{{node dense/MatMul}} = MatMul[T=DT_FLOAT, _output_shapes=[[?,256]], transpose_a=false, transpose_b=false, _device="/job:localhost/replica:0/task:0/device:CPU:0"](flatten/Reshape, dense/MatMul/ReadVariableOp)]]

marquies Over a year ago

The Example Link in the response is not valid anymore.

Tony Ennis Over a year ago

what is makeInputTensor()? Something the user would code that returns a Tensor of the appropriate dimension?

Tony Ennis Over a year ago

And on the next line, output is used before it is defined, seemingly.

Collectives™ on Stack Overflow

tensorflow.keras save model in python and loading in Java

1 Answer 1

5 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

5 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related