NLP is a set of tools used to derive meaningful and useful information from natural language sources such as web pages and text documents. Apache OpenNLP is an open-source Java library which is used to process natural language text. You can build an efficient text processing service using this library.
- Download Apache OpenNLP distribution
- Extract the zip file
- Add
path-to/apache-opennlp-1.9.3/binin your Environment Variables
- Open this repo in any of your favorite IDE, i am using IntelliJ IDEA :)
- Add
opennlp-tools-1.9.3.jarandopennlp-uima-1.9.3.jarin your project
Download the pre-trained models provided by Apache OpenNLP as follows
- Open
OpenNLPModelsDownloader.javafile in terminal and execute command with space separated all the models taken from Apache OpenNLP website. You can download as many models as you want.
java OpenNLPModelsDownloader.java da-sent.bin en-ner-person.bin
Run SentenceDetection or NamedEntityDetection to test these model.