Skip to content

A Hadoop MapReduce java implementation for disambiguating entities from simple English sentences using a phrasal knowledge base.

License

Notifications You must be signed in to change notification settings

VedantMahabaleshwarkar/phrasebase

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Phrase Sense Disambiguation for NLP using PhraseBase

A Hadoop MapReduce java implementation for disambiguating entities from simple English sentences using a phrasal knowledge base.

Requirements

Steps

  • Clone the repository

  • Run in terminal hadoop com.sun.tools.javac.Main PhraseBase.java

  • Run jar cf PhraseBase.jar PhraseBase*.class

  • Create input.txt and write any simple sentence that has

    • subject
    • verb phrase
    • object

    eg. Sachin Tendulkar was born in Mumbai.

    You can provide multiple sentences in the input file but be sure to write each sentence on a different line and end them with a period (.)

  • Run hadoop jar PhraseBase.jar PhraseBase input.txt
    For more detailed theory and documentation about the project read the attached PDF

About

A Hadoop MapReduce java implementation for disambiguating entities from simple English sentences using a phrasal knowledge base.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 100.0%