Skip to content

Construct entity sense embeddings using DT and DeepWalk  #3

@alexanderpanchenko

Description

@alexanderpanchenko

Background

Data

  1. A Distributional Thesaurus (DT)
  1. Training datasets

Code

Steps

  1. Take the DT and compute coverage of the target entities from the https://docs.google.com/spreadsheets/d/1umTW0h8hGKqN1NSEpgds36qfhFZC4VO5dBjQ940dUY4/edit?usp=sharing. Report the coverage here.

  2. Build a graph from the DT and compute it’s graph embeddings using DeepWalk.

  • prune from the graph edges with very small (eg t < 0.001) scores
  • ALTERNATIVELY ADDITIONALLY build a graph of target entities and all related words
  1. Report here some nearest neighbors of some entities here like Michael Jordan.

  2. Create a disambiguated graph of senses using the provided code.

  3. Compute embeddings from the graph of senses like before using the DeepWalk. Report sense nearest neighbors.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions