embedding question from filter text_feat.npy and image_feat

Thanks for your wonderful contribution for embedding netflix item data.

In python, when I load your Netflix data, the `text_feat.npy` and `image_feat.npy` both represents a numpy adarray. To be more exact:

```python
text_feat = np.load('text_feat.npy')
image_feat = np.load('image_feat.npy')

print(text_feat.shape) # -> 17366 * 768 
print(image_feat.shape) # -> 17366 * 512 
```

May I ask if it is true that the organization of text_feat and image_feat are by the sequence of, for each row,
item 1, [embedding 1];
item 2,[embedding 2]; # as itemid sequence
...
or
item 9733, [embedding 9733];
item 14147, [embedding 14147]; # as the sequence from `item_attribute.csv`
...

Thanks! I am carrying out embedding_based i2i similarity recommendation. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

embedding question from filter text_feat.npy and image_feat #17

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

embedding question from filter text_feat.npy and image_feat #17

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions