How to understand the term `tensor` in TensorFlow?

Question

I am new to TensorFlow. While I am reading the existing documentation, I found the term tensor really confusing. Because of it, I need to clarify the following questions:

What is the relationship between tensor and Variable, tensor
vs. tf.constant, 'tensor' vs. tf.placeholder?
Are they all types of tensors?

I think understanding what **data** and **operations** are in TensorFlow would be helpful for this question. — zhy, Aug 01 '17 at 16:05

score 69 · Accepted Answer · edited Oct 02 '17 at 15:09

TensorFlow doesn't have first-class Tensor objects, meaning that there are no notion of Tensor in the underlying graph that's executed by the runtime. Instead the graph consists of op nodes connected to each other, representing operations. An operation allocates memory for its outputs, which are available on endpoints :0, :1, etc, and you can think of each of these endpoints as a Tensor. If you have tensor corresponding to nodename:0 you can fetch its value as sess.run(tensor) or sess.run('nodename:0'). Execution granularity happens at operation level, so the run method will execute op which will compute all of the endpoints, not just the :0 endpoint. It's possible to have an Op node with no outputs (like tf.group) in which case there are no tensors associated with it. It is not possible to have tensors without an underlying Op node.

You can examine what happens in underlying graph by doing something like this

tf.reset_default_graph()
value = tf.constant(1)
print(tf.get_default_graph().as_graph_def())

So with tf.constant you get a single operation node, and you can fetch it using sess.run("Const:0") or sess.run(value)

Similarly, value=tf.placeholder(tf.int32) creates a regular node with name Placeholder, and you could feed it as feed_dict={"Placeholder:0":2} or feed_dict={value:2}. You can not feed and fetch a placeholder in the same session.run call, but you can see the result by attaching a tf.identity node on top and fetching that.

For variable

tf.reset_default_graph()
value = tf.Variable(tf.ones_initializer()(()))
value2 = value+3
print(tf.get_default_graph().as_graph_def())

You'll see that it creates two nodes Variable and Variable/read, the :0 endpoint is a valid value to fetch on both of these nodes. However Variable:0 has a special ref type meaning it can be used as an input to mutating operations. The result of Python call tf.Variable is a Python Variable object and there's some Python magic to substitute Variable/read:0 or Variable:0 depending on whether mutation is necessary. Since most ops have only 1 endpoint, :0 is dropped. Another example is Queue -- close() method will create a new Close op node which connects to Queue op. To summarize -- operations on python objects like Variable and Queue map to different underlying TensorFlow op nodes depending on usage.

For ops like tf.split or tf.nn.top_k which create nodes with multiple endpoints, Python's session.run call automatically wraps output in tuple or collections.namedtuple of Tensor objects which can be fetched individually.

what do you mean by `first-class Tensor objects`? There is a class called `tf.Tensor`. Could explain with an example if possible. — Abhishek Bhatia, Aug 16 '17 at 18:00
there are no tensors in the underlying graph, just ops connected to other ops — Yaroslav Bulatov, Aug 16 '17 at 18:06
how do you access an operation? Can you only access the inputs and outputs of operation directly? — Abhishek Bhatia, Aug 16 '17 at 18:09
I may be being too vague, for a more precise explanation I recommend reading this paper -- http://dl.acm.org/citation.cfm?doid=3088525.3088527 — Yaroslav Bulatov, Aug 16 '17 at 19:12
Thanks for this explanation! (+1) The second example doesn't work for me, using `tf.__version__ = 1.1.0`. the `Variable` requires positional argument: `shape`. — n1k31t4, Aug 21 '17 at 04:38

Olivier Moindrot · Answer 2 · 2016-06-16T13:43:20.170

15

From the glossary:

A Tensor is a typed multi-dimensional array. For example, a 4-D array of floating point numbers representing a mini-batch of images with dimensions [batch, height, width, channel].

Basically, every data is a Tensor in TensorFlow (hence the name):

placeholders are Tensors to which you can feed a value (with the feed_dict argument in sess.run())
Variables are Tensors which you can update (with var.assign()). Technically speaking, tf.Variable is not a subclass of tf.Tensor though
tf.constant is just the most basic Tensor, which contains a fixed value given when you create it

However, in the graph, every node is an operation, which can have Tensors as inputs or outputs.

edited Jun 16 '16 at 13:43

answered Jun 16 '16 at 08:48

Olivier Moindrot

27,908
11
92
91

4

I don't agree. If you look at section 2 of tensorflow white paper:"In a TensorFlow graph, each node has zero or more inputs and zero or more outputs, and represents the instantiation of an operation". Variables, constant and placeholders are nodes, aka, instantiation of OPERATIONS just like tf.mul or tf.add . I think they produce tensors as output, but they themselves are not tensors. – ZijunLost Jun 16 '16 at 13:28
Well yes, the graph is composed of operations, which pass Tensors between themselves. I will update my tensor to explain the operations linked to `tf.constant` and `tf.placeholder` – Olivier Moindrot Jun 16 '16 at 13:39
2

thanks, but I still believe it is better to call placeholders or constants as 'operations' that produce tensors instead of saying "placeholders are tensors". If you read the doc: "TensorFlow provides a placeholder operation that must be fed with data on execution. For more info, see the section on Feeding data." – ZijunLost Jun 16 '16 at 14:21

kmario23 · Answer 3 · 2018-05-27T15:18:38.547

As already mentioned by others, yes they are all tensors.

The way I understood those is to first visualize and understand 1D, 2D, 3D, 4D, 5D, and 6D tensors as in the picture below. (source: knoldus)

Now, in the context of TensorFlow, you can imagine a computation graph like the one below,

Here, the Ops take two tensors a and b as input; multiplies the tensors with itself and then adds the result of these multiplications to produce the result tensor t3. And these multiplications and addition Ops happen at the nodes in the computation graph.

And these tensors a and b can be constant tensors, Variable tensors, or placeholders. It doesn't matter, as long as they are of the same data type and compatible shapes(or broadcastable to it) to achieve the operations.

score 1 · Answer 4 · answered Aug 18 '22 at 10:01

Data is stored in matrices. A 28x28 pixel grayscale image fits into a 28x28 two-dimensional matrix. But for a color image, we need more dimensions. There are 3 color values per pixel (Red, Green, Blue), so a three-dimensional table will be needed with dimensions [28, 28, 3]. And to store a batch of 128 color images, a four-dimensional table is needed with dimensions [128, 28, 28, 3].

These multi-dimensional tables are called "tensors" and the list of their dimensions is their "shape".

Source

score -1 · Answer 5 · answered May 20 '18 at 06:54

TensorFlow's central data type is the tensor. Tensors are the underlying components of computation and a fundamental data structure in TensorFlow. Without using complex mathematical interpretations, we can say a tensor (in TensorFlow) describes a multidimensional numerical array, with zero or n-dimensional collection of data, determined by rank, shape, and type.Read More: What is tensors in TensorFlow?

How to understand the term `tensor` in TensorFlow?

5 Answers5

Linked