Config lefthook #25

SalihuDickson · 2024-08-24T22:46:49Z

Summary by Sourcery

Refactor the TES and WES converters to use Pydantic for data validation, improving data integrity and error handling. Introduce new Pydantic models for TES and WES data. Enhance the CI workflow to run tests conditionally and trigger on all branches. Add unit tests for WRROC models and validators. Configure pre-push hooks using Lefthook for code linting with Ruff.

New Features:

Introduce Pydantic models for TES and WES data validation, enhancing data integrity and error handling in the conversion process.
Add a new lefthook.yml configuration file to set up pre-push hooks for code linting with Ruff.

Enhancements:

Refactor TES and WES converters to use Pydantic for data validation, improving code reliability and maintainability.
Update the CLI to use double quotes for consistency in option definitions and improve readability.
Enhance the CI workflow to run tests automatically and ensure code quality by adding a condition to run tests only if previous steps succeed.

CI:

Modify the CI workflow to trigger on all branches instead of just the main branch, allowing for more comprehensive testing across different development branches.

Tests:

Add unit tests for WRROC models and validators to ensure correct functionality and data validation.

This reverts commit f64ba49.

This reverts commit db07662.

This reverts commit f64ba49.

sourcery-ai · 2024-08-24T22:46:56Z

Reviewer's Guide by Sourcery

This pull request introduces significant changes to the project structure and functionality, focusing on improving data validation, error handling, and code organization. The changes include the addition of new models for WRROC, TES, and WES data structures, implementation of validators, updates to existing converters, and the introduction of a pre-push hook for code linting.

File-Level Changes

Change	Details	Files
Implemented Pydantic models for data validation	Created WRROC models (WRROCProcess, WRROCWorkflow, WRROCProvenance) Added TES models for task execution data Introduced WES models for workflow execution data	`crategen/models/wrroc_models.py` `crategen/models/tes_models.py` `crategen/models/wes_models.py`
Updated converters to use new Pydantic models	Refactored TESConverter to use TESData model for validation Updated WESConverter to use WESData model for validation Improved error handling in converters	`crategen/converters/tes_converter.py` `crategen/converters/wes_converter.py`
Introduced validators for WRROC data	Implemented validate_wrroc function to determine WRROC profile Added validate_wrroc_tes for TES-specific validation Created validate_wrroc_wes for WES-specific validation	`crategen/validators.py`
Updated CI workflow and added pre-push hook	Modified CI workflow to run on all branches Added Ruff linter check to CI process Implemented pre-push hook using Lefthook for Ruff linting	`.github/workflows/ci.yml` `lefthook.yml`
Refactored and improved existing code	Updated CLI implementation for better error handling Improved utility functions for date/time conversions Refactored AbstractConverter for consistency	`crategen/cli.py` `crategen/utils.py` `crategen/converters/abstract_converter.py`
Added unit tests for new functionality	Created unit tests for WRROC models and validators	`tests/unit/test_wrroc_models.py`

Tips

Trigger a new Sourcery review by commenting @sourcery-ai review on the pull request.
Continue your discussion with Sourcery by replying directly to review comments.
You can change your review settings at any time by accessing your dashboard:
- Enable or disable the Sourcery-generated pull request summary or reviewer's guide;
- Change the review language;
You can always contact us if you have any questions or feedback.

sourcery-ai

Hey @SalihuDickson - I've reviewed your changes and they look great!

Here's what I looked at during the review

🟡 General issues: 1 issue found
🟢 Security: all looks good
🟡 Testing: 1 issue found
🟢 Complexity: all looks good
🟢 Documentation: all looks good

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment to tell me if it was helpful.}

sourcery-ai · 2024-08-24T22:48:08Z

crategen/converters/tes_converter.py

+            volumes,
+            logs,
+            tags,
+        ) = validated_tes_data.dict().values()


suggestion: Use a more robust method for extracting validated data

Using .dict().values() is potentially fragile if the order of fields in the Pydantic model changes. Consider using named attributes instead, e.g., validated_tes_data.id, validated_tes_data.name, etc. This would make the code more robust and easier to understand.

Suggested change

) = validated_tes_data.dict().values()

id = validated_tes_data.id

name = validated_tes_data.name

description = validated_tes_data.description

executors = validated_tes_data.executors

inputs = validated_tes_data.inputs

outputs = validated_tes_data.outputs

volumes = validated_tes_data.volumes

logs = validated_tes_data.logs

tags = validated_tes_data.tags

sourcery-ai · 2024-08-24T22:48:08Z

tests/unit/test_wrroc_models.py

+    def test_validate_wrroc_tes_missing_fields(self):
+        """
+        Test that validate_wrroc_tes raises a ValueError if required fields for TES conversion are missing.
+        """
+        data = {"id": "process-id", "name": "Test Process"}
+        with self.assertRaises(ValueError):
+            validate_wrroc_tes(data)


suggestion (testing): Consider adding more specific test cases for TES validation

While this test checks for missing fields, it might be beneficial to add more specific test cases that check each required field individually. This could help pinpoint exactly which field validations might fail in the future.

Suggested change

def test_validate_wrroc_tes_missing_fields(self):

"""

Test that validate_wrroc_tes raises a ValueError if required fields for TES conversion are missing.

"""

data = {"id": "process-id", "name": "Test Process"}

with self.assertRaises(ValueError):

validate_wrroc_tes(data)

def test_validate_wrroc_tes_missing_fields(self):

"""Test validate_wrroc_tes raises ValueError for missing required fields."""

required_fields = ['id', 'name', 'description', 'executors']

for field in required_fields:

data = {f: "value" for f in required_fields if f != field}

with self.subTest(f"Missing {field}"):

with self.assertRaises(ValueError):

validate_wrroc_tes(data)

sourcery-ai · 2024-08-24T22:48:08Z

crategen/converters/tes_converter.py

+            logs,
+            tags,
+        ) = validated_tes_data.dict().values()
+        end_time = validated_tes_data.logs[0].end_time

        # Convert to WRROC
        wrroc_data = {


issue (code-quality): Inline variable that is immediately returned (inline-immediately-returned-variable)

sourcery-ai · 2024-08-24T22:48:08Z

crategen/converters/tes_converter.py

-        start_time = wrroc_data.get("startTime", "")
-        end_time = wrroc_data.get("endTime", "")
+    def convert_from_wrroc(self, data):
+        # Validate WRROC data


issue (code-quality): We've found these issues:

Inline variable that is immediately returned (inline-immediately-returned-variable)

Don't assign to builtin variable id (avoid-builtin-shadow)

Explanation

Python has a number of builtin variables: functions and constants that
form a part of the language, such as list, getattr, and type
(See https://docs.python.org/3/library/functions.html).
It is valid, in the language, to re-bind such variables:

list = [1, 2, 3]

However, this is considered poor practice.

It will confuse other developers.

It will confuse syntax highlighters and linters.

It means you can no longer use that builtin for its original purpose.

How can you solve this?

Rename the variable something more specific, such as integers.
In a pinch, my_list and similar names are colloquially-recognized
placeholders.

sourcery-ai · 2024-08-24T22:48:08Z

crategen/converters/wes_converter.py

        wes_data = {
            "run_id": run_id,
-            "run_log": {
-                "name": name,
-                "start_time": start_time,
-                "end_time": end_time,
-            },
+            "run_log": {"name": name, "start_time": start_time, "end_time": end_time},
            "state": state,
-            "outputs": [{"location": res.get("@id", ""), "name": res.get("name", "")} for res in result_data],
+            "outputs": [{"location": res.id, "name": res.name} for res in result_data],
        }


issue (code-quality): Inline variable that is immediately returned (inline-immediately-returned-variable)

sourcery-ai · 2024-08-24T22:48:08Z

crategen/models/tes_models.py

+
+        if content_is_set:
+            values["url"] = None
+        elif not content_is_set and not url_is_set:


suggestion (code-quality): Remove redundant conditional (remove-redundant-if)

Suggested change

elif not content_is_set and not url_is_set:

elif not url_is_set:

sourcery-ai · 2024-08-24T22:48:09Z

crategen/validators.py

+    missing_fields = [
+        field for field in required_fields if getattr(validated_data, field) is None
+    ]
+
+    if missing_fields:


suggestion (code-quality): Use named expression to simplify assignment and conditional (use-named-expression)

Suggested change

missing_fields = [

field for field in required_fields if getattr(validated_data, field) is None

]

if missing_fields:

if missing_fields := [

field

for field in required_fields

if getattr(validated_data, field) is None

]:

sourcery-ai · 2024-08-24T22:48:09Z

crategen/validators.py

+    missing_fields = [
+        field for field in required_fields if getattr(validated_data, field) is None
+    ]
+
+    if missing_fields:


suggestion (code-quality): Use named expression to simplify assignment and conditional (use-named-expression)

Suggested change

missing_fields = [

field for field in required_fields if getattr(validated_data, field) is None

]

if missing_fields:

if missing_fields := [

field

for field in required_fields

if getattr(validated_data, field) is None

]:

salihuDickson added 20 commits August 21, 2024 22:52

improve tes models

db0fc0d

update lint config and fix lint issue

775e72f

block push if lint fails

4c82117

block push if lint fails

9724ccf

configure lefthook

f64ba49

configure lefthook

db07662

Revert "configure lefthook"

0b4b94b

This reverts commit f64ba49.

Revert "configure lefthook"

17cee80

This reverts commit db07662.

Revert "configure lefthook"

54a4d31

This reverts commit f64ba49.

configure lefthook

d7bf79f

configure lefthook

8d6f81a

lint code

25eb0c5

lint code

1697a12

lint code

600134c

add utils.py

d400654

lint code

db31c39

remove tes models from default models file

662adb1

separate models into different files

4157c5e

lint code

fae0a06

fix test imports

c5be79a

sourcery-ai bot reviewed Aug 24, 2024

View reviewed changes

SalihuDickson changed the base branch from main to models August 24, 2024 22:58

salihuDickson added 2 commits August 25, 2024 00:31

Merge remote-tracking branch 'origin/models' into config-lefthook

420bbe6

lint

4830de7

SalihuDickson merged commit 3dc009a into models Aug 24, 2024
2 checks passed

SalihuDickson deleted the config-lefthook branch August 24, 2024 23:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Config lefthook #25

Config lefthook #25

SalihuDickson commented Aug 24, 2024 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Aug 24, 2024 •

edited

Loading

sourcery-ai bot left a comment

sourcery-ai bot Aug 24, 2024

sourcery-ai bot Aug 24, 2024

sourcery-ai bot Aug 24, 2024

sourcery-ai bot Aug 24, 2024

sourcery-ai bot Aug 24, 2024

sourcery-ai bot Aug 24, 2024

sourcery-ai bot Aug 24, 2024

sourcery-ai bot Aug 24, 2024

-        ) = validated_tes_data.dict().values()
+        id = validated_tes_data.id
+        name = validated_tes_data.name
+        description = validated_tes_data.description
+        executors = validated_tes_data.executors
+        inputs = validated_tes_data.inputs
+        outputs = validated_tes_data.outputs
+        volumes = validated_tes_data.volumes
+        logs = validated_tes_data.logs
+        tags = validated_tes_data.tags

	elif not content_is_set and not url_is_set:
	elif not url_is_set:

Config lefthook #25

Config lefthook #25

Conversation

SalihuDickson commented Aug 24, 2024 • edited by sourcery-ai bot Loading

Summary by Sourcery

sourcery-ai bot commented Aug 24, 2024 • edited Loading

Reviewer's Guide by Sourcery

File-Level Changes

sourcery-ai bot left a comment

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

sourcery-ai bot Aug 24, 2024

Choose a reason for hiding this comment

SalihuDickson commented Aug 24, 2024 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Aug 24, 2024 •

edited

Loading