Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add different variations of PhAlkEthOH datasets #216

Open
chrisiacovella opened this issue Aug 2, 2024 · 1 comment
Open

Add different variations of PhAlkEthOH datasets #216

chrisiacovella opened this issue Aug 2, 2024 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@chrisiacovella
Copy link
Member

Currently the PhAlkEthOh dataset comes from the OpenFF optimization dataset and contains the entire optimization trajectory for each unique molecule.

It would be good to have a few additional variations of this dataset for exploring various aspects of the different NNPs and how data generation strategies impact efficacy. A few additional "versions" to add for the existing dataset:

  • The first (rdkit generated) configuration
  • The first (rdkit generated) and last (energy minimized) configurations in the trajectory.
  • The first few configurations in the trajectory; this would replicate the idea of just doing a few steps of optimization.

Related, I will work on getting additional calculations going using the trajectories generated with GAFF.

@chrisiacovella
Copy link
Member Author

This was mostly addressed in PR #245 . This PR removed configurations with high forces (about 1 hatree/bohr, just like done in spice). This also generates a test/full dataset that only contain the final energy minimized configuration, to make something that is very similar to qm9 (but with forces). This can serve as a baseline for also seeing importance of a few steps of optimization, MD generated configurations, etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant