Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(tlg0385.tlg001.perseus-grc1.xml) EpiDoc and CTS conversion #1508

Open
lcerrato opened this issue Sep 19, 2023 · 2 comments
Open

(tlg0385.tlg001.perseus-grc1.xml) EpiDoc and CTS conversion #1508

lcerrato opened this issue Sep 19, 2023 · 2 comments

Comments

@lcerrato
Copy link
Collaborator

lcerrato commented Sep 19, 2023

There are several issues with this file:

  1. It was split, with spurious (?) fragments removed see tlg0385 Cassius Dio questions #150

  2. The notes were not encoded properly, so that Greek in the notes is unconverted beta code. Which is possible to deal with if there are at least tags around the Greek. (There aren't) see (tlg0385.tlg001.perseus-grc1.xml) Beta code for Greek in footnotes #550

  3. The parentheses were not properly encoded, so that the beta code to Unicode conversion means bad Unicode.

So

<milestone unit="chapter" n="23"/><milestone unit="section" n="1"/> o( ou)=n *seroui/lios stratiw/tas te/ tinas e)s
*galati/an kata\ tu/xhn pario/ntas metepe/myato,
kai\ th\n boulh\n th=| par' au)tw=n froura=| sunagagw\n
proe/qhke gnw/mhn peri\ tw=n paro/ntwn, kai\ kurwqe/ntos
me\n mhdeno\s (dh/marxoi ga\r e)kw/lusan'
suggrafe/ntos de\ tou= do/cantos e)ke/leuse toi=s

where (dh/marxoi ga\r e)kw/lusan' is in parentheses as here
image

is presently in the file as ʽδήμαρχοι γὰρ ἐκώλυσανʼ

  1. There may be other such bad encoding issues where entities should have been used but were not.
  2. The sections are placed on the print line but not at the hard breaks.
@AlisonBabeu
Copy link
Collaborator

@lcerrato great googly moogly!

@lcerrato lcerrato self-assigned this Sep 26, 2024
@lcerrato
Copy link
Collaborator Author

These problems remain, although I tried a few automated processes to address the parentheses. The notes and section breaks are still an issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants