1.3 Node and Edge Features¶
The nodes and edges of a DGLGraph
can have several user-defined named features for
storing graph-specific properties of the nodes and edges. These features can be accessed
via the ndata
and edata
interface. For example, the following code creates two node
features (named 'x'
and 'y'
in line 8 and 15) and one edge feature (named 'x'
in line 9).
Important facts about the ndata
/edata
interface:
Only features of numerical types (e.g., float, double, and int) are allowed. They can be scalars, vectors or multi-dimensional tensors.
Each node feature has a unique name and each edge feature has a unique name. The features of nodes and edges can have the same name. (e.g., ‘x’ in the above example).
A feature is created via tensor assignment, which assigns a feature to each node/edge in the graph. The leading dimension of that tensor must be equal to the number of nodes/edges in the graph. You cannot assign a feature to a subset of the nodes/edges in the graph.
Features of the same name must have the same dimensionality and data type.
The feature tensor is in row-major layout – each row-slice stores the feature of one node or edge (e.g., see lines 16 and 18 in the above example).
For weighted graphs, one can store the weights as an edge feature as below.
>>> # edges 0->1, 0->2, 0->3, 1->3
>>> edges = th.tensor([0, 0, 0, 1]), th.tensor([1, 2, 3, 3])
>>> weights = th.tensor([0.1, 0.6, 0.9, 0.7]) # weight of each edge
>>> g = dgl.graph(edges)
>>> g.edata['w'] = weights # give it a name 'w'
>>> g
Graph(num_nodes=4, num_edges=4,
ndata_schemes={}
edata_schemes={'w' : Scheme(shape=(,), dtype=torch.float32)})