Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(build): changes to decrease build time, cancel runs in case of multiple commits #5187

Merged
merged 19 commits into from
Jun 17, 2022

Conversation

anshbansal
Copy link
Collaborator

Checklist

  • The PR conforms to DataHub's Contributing Guideline (particularly Commit Message Format)
  • Links to related issues (if applicable)
  • Tests for the changes have been added/updated (if applicable)
  • Docs related to the changes have been added/updated (if applicable). If a new feature has been added a Usage Guide has been added for the same.
  • For any breaking change/potential downtime/deprecation/big changes an entry has been made in Updating DataHub

@github-actions
Copy link

github-actions bot commented Jun 16, 2022

Unit Test Results (build & test)

382 tests  ±0   382 ✔️ ±0   9m 15s ⏱️ +13s
  89 suites ±0       0 💤 ±0 
  89 files   ±0       0 ±0 

Results for commit c756555. ± Comparison against base commit b0db17e.

♻️ This comment has been updated with latest results.

@github-actions
Copy link

github-actions bot commented Jun 16, 2022

Unit Test Results (metadata ingestion)

       6 files  +    1         6 suites  +1   1h 13m 12s ⏱️ - 28m 58s
   556 tests ±    0     553 ✔️ ±    0    3 💤 ±0  0 ±0 
1 992 runs   - 565  1 920 ✔️  - 562  72 💤  - 3  0 ±0 

Results for commit c756555. ± Comparison against base commit b0db17e.

♻️ This comment has been updated with latest results.

@anshbansal anshbansal force-pushed the parallelization-tests branch from 643bfb8 to b6e71d1 Compare June 17, 2022 07:20
@anshbansal anshbansal changed the title [WIP] Parallelization tests feat(build): changes to decrease build time, cancel runs in case of multiple commits Jun 17, 2022
Copy link
Contributor

@treff7es treff7es left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@@ -321,7 +321,6 @@ def get_long_description():
"pytest-asyncio>=0.16.0",
"pytest-cov>=2.8.1",
"pytest-docker>=0.10.3,<0.12",
"tox",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why did you remove tox?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also interested to know!

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

tried parallelising via tox but there was some problem during builds. the build wheel being generated by the 3 test suites was stepping on each other's toes and causing at least one of them to fail. Also, thought it might be better to do it via github actions so we can easily parallelise and isolate. Now if that SAP HANA test flakes we can simply re-run that (that and nifi one under slow integration) one instead of all integration tests.

@anshbansal anshbansal merged commit 4541379 into datahub-project:master Jun 17, 2022
@anshbansal anshbansal deleted the parallelization-tests branch June 17, 2022 12:35
@@ -15,6 +15,10 @@ on:
release:
types: [published, edited]

concurrency:
group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
cancel-in-progress: true
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This means cancelling only those in progress for the same PR right?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For the same branch

@@ -24,6 +25,11 @@
logger = logging.getLogger(__name__)


telemetry_enabled = (
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is this guy in here?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Because in case we had file sink this was causing telemetry client id to fetched. In case telemetry is disabled it should not be doing that.

For unit tests this was doing 3 retries and failing. Adding this and passing the environment variable reduced time for unit tests by 30-60 secs.

alexey-kravtsov pushed a commit to infobip/datahub that referenced this pull request Jul 8, 2022
maggiehays pushed a commit to maggiehays/datahub that referenced this pull request Aug 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants