Tensor associated to a database
In this post (the 2nd in this series on Tensors and Language Models), we will be interested in collections of tables or dataframes that look something like:
We will call these databases. This example database comes from the paper on quantifying the knowledge capacity of attention layers.
By the end of this post, we will see how to define a tensor associated to a database. But first, we will begin by defining a matrix associated to a function $f: X \to Y$.