Skip to content

nikhibdg/Brown-Clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Brown-Clustering

Description

  • Implementation of Brown Clustering Algorithm
  • The algorithm trains on data and does heirarchial clustering
  • Based on this clustering, it generates a unique vector of each word

Usage

  • Dependencies : numpy and scipy

  • On running the code, it trains on the small subset of data named - "subset_data.txt"

  • This dataset contains dummy pos tags

  • It clusters the similar words and prints and saves the clusters and the vectors of the words.

    python3 brown_clustering.py 
    

Releases

No releases published

Packages

No packages published

Languages