Skip to content

tam-borine/Hansard

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Exploring Hansard Transcripts

The script so far:

  • Fetches and cleans all hansard transcripts from 2017.
  • Gets the inverse document frequency for prominent words excluding stopwords with TfidfVectorizer.
  • Visualises the distribution and show a few documents sorted by words that define it most.

About

Text analysis on transcripts from parliament

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published