Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

phi0959.phi015 has a very wrong number of words #185

Open
PonteIneptique opened this issue Dec 30, 2021 · 1 comment
Open

phi0959.phi015 has a very wrong number of words #185

PonteIneptique opened this issue Dec 30, 2021 · 1 comment

Comments

@PonteIneptique
Copy link
Member

Hi @AlisonBabeu :D

In

<mods:extent unit="words">
<mods:total>224765</mods:total>
</mods:extent>

200k words are counted, but, it's a poem of 474 lines: https://books.google.fr/books?id=I9QIAAAAQAAJ&pg=PA97&redir_esc=y&hl=fr#v=onepage&q&f=false
Might have been a count based on the full book ?

@AlisonBabeu
Copy link
Contributor

hi @PonteIneptique, so this word count is not something that Perseus itself generated or counted, but have instead inherited them from the PHI Canon, which states that this work as 224765 words. The XML file can be found here (https://github.com/PerseusDL/catalog_data/blob/master/perseus/wordcounts.xml) and all of the word counts for Latin works in the Perseus Catalog (that had a PHI ID) were added automatically. It is strange in that the text of the file here in the PHI canon (https://latin.packhum.org/loc/959/15/0#0) seems to be nowhere near that long. I actually have no idea how these word counts were first generated by the PHI, but this is not the only work I've found where the word count seems to be way off. Hope this answers your question, and happy new year!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants