An Elasticsearch Plugin that Integrates NLP Capabilities into IR Scoring Algorithms
This plugin requires Java 8+
This plugin currently supports and requires Elasticsearch 7.3.0.
Due to fundamental changes in underlying lucene data structures between versions, it is currently unfeasible to support multiple versions of ES, although that may change in the future.
For the purposes of backreferencing, the supported Elasticsearch versions of each major release are listed below, in reverse chronological order, although features are not backported between versions
- v1.0.2 - Elasticsearch 7.3.0
- v1.0.1 - Elasticsearch 7.3.0
- v1.0.0 - Elasticsearch 6.6.0
- Please direct yourself to the releases page and download the
ES-NLP-PLUGIN.zipfor your desired release. - Follow the installation steps for installing elasticsearch plugins from filesystem here, directing it at the downloaded zip file
- Copy
config.ymlin the zip to<your_es_directory>/config/elasticsearch-nlp-plugin.ymland edit appropriately - Documents need to be indexed using the
nlptokenizer
Making customizations to this codebase is fairly straightforward - you will need JDK 8+, Apache Maven, and Apache Ant
- First, clone this repository and make your desired changes.
- Run
mvn clean installto generate the plugin JAR - Run
ant distto generate the distribution zip that is consumed by Elasticsearch's plugin installer - Follow the installation steps for installing elasticsearch plugins from filesystem here, directing it at the downloaded zip file
A. Wen, Y. Wang, V. C. Kaggal, S. Liu, H. Liu and J. Fan, "Enhancing Clinical Information Retrieval through Context-Aware Queries and Indices," 2019 IEEE International Conference on Big Data (Big Data), Los Angeles, CA, USA, 2019, pp. 2800-2807, doi: 10.1109/BigData47090.2019.9006241.