Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Email subscription scraper #29

Open
Mearnest opened this issue Jan 25, 2017 · 7 comments
Open

Email subscription scraper #29

Mearnest opened this issue Jan 25, 2017 · 7 comments

Comments

@Mearnest
Copy link
Collaborator

A really super awesome feature would be a scraper that generates meeting events from meeting information in emails. These emails would come from email subscriptions to different board meetings and what not.

Unfortunately, the information is likely to be inside a word or pdf document.

The motivation for this is to be able to keep the website up to date.

@Mearnest
Copy link
Collaborator Author

Mearnest commented Jan 25, 2017

This can be in any programming language (even the ones below), it just needs to feed the postgres database Elixir reads from.

(insert-into postgres (scrape (email-attach word)) (scrape (email-attach pdf)))

Even This image Haskell

@aaronarduino
Copy link
Collaborator

I'd be interested in making this. Would we use a email service like mailgun?

@aaronarduino
Copy link
Collaborator

Or maybe we could use something like https://context.io/docs/lite?

@Mearnest
Copy link
Collaborator Author

Mearnest commented Jan 27, 2017

Whatever works best. Keep in mind that the resulting data has to be inserted into the meeting_types, meeting_dates, and meeting_extras tables in postgres, unless generating json is easier, or providing Elixir with an API.

@Mearnest
Copy link
Collaborator Author

You pretty much have total freedom to use any tech here. It just needs to be run on the same server, unless Elixir is talking to an api running somewhere else.

@aaronarduino aaronarduino self-assigned this Jan 27, 2017
@aaronarduino
Copy link
Collaborator

I've been working on this lately, see link for progress: https://github.com/aaronarduino/public-meetings/tree/proto-email-scraper

@Mearnest
Copy link
Collaborator Author

Mearnest commented Feb 1, 2017

Looking good so far!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants