Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

build: Check to see if npm dependencies are already up to date; #2801

Merged
merged 9 commits into from
Jun 13, 2023
Merged

build: Check to see if npm dependencies are already up to date; #2801

merged 9 commits into from
Jun 13, 2023

Conversation

richb-hanover
Copy link
Contributor

New task "check-npm-cache" that avoids unnecessary long startup times.
Save a .npmtimestamp file in the node_modules directory after npm install. If a subsequent call to check-npm-cache shows the package.json is older, then
there's no need to re-run npm install...

richb-hanover and others added 2 commits June 11, 2023 21:10
only issue npm install --install-links=false if package.json has been modified
   since last time.
@max-sixty max-sixty changed the title build: Check to see if npm depencies are already up to date; build: Check to see if npm dependencies are already up to date; Jun 12, 2023
Taskfile.yml Outdated
# symlink. Refer to https://github.com/PRQL/prql/pull/1296.
desc: Check to see if package.json has changed since we ran npm install
dir: web/playground
status: # if both are true, don't rebuild
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Probably. My concern is that the node_modules folder has ~50,000 files. Computing a checksum over all those files might take nearly as much time as the npm install I'll check thetimestamp method to see how it performs.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we use it on package.json though? Like we're doing here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remarkably, the checksum option is fast - less than a second. It's also indistinguishable from the timestamp method. So this commit switches to using checksums.

My test process was to modify package.json (add a blank line) and run task run-playground which calls npm install ...

I then ^C, and re-ran task run-playground and the Task reports, task: Task "check-npm-cache" is up to date

Can you suggest any other tests? Thanks.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm actually not sure if it's using the files under generates; I couldn't completely decipher the docs:

For the checksum (default) or timestamp method to work, it is only necessary to inform the source files. When the timestamp method is used, the last time of the running the task is considered as a generate.

Taskfile.yml Outdated
- npm start
#

check-npm-cache:
Copy link
Member

@max-sixty max-sixty Jun 12, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Overall this is great, very nice addition.

Can we rename this to something like install-playground? It's doing more than just checking the cache.

Then let's merge! (edit: one note added below)

@max-sixty
Copy link
Member

Actually one question — do we need to reinstall when rust files in prql-compiler change? Will this forget to pick that up?

Even if that is the case, we can still have a task like run-playground-cached which does this.

@richb-hanover
Copy link
Contributor Author

re: renaming - a good idea

re: prql-compilerchanges - I bet that would be important. It complicates the sources/generates statements, since they're not in the same directory. (At least my simple-minded first cut of adding a test for prql-compiler/src/* didn't work.)

I'll keep bulling ahead...

@max-sixty
Copy link
Member

No prob — why don't we merge this with something like run-playground-cached, add a desc: field, and then you're not waiting around?

And then we can refactor this all when we upgrade rust and can do this as part of the standard build, which is cached itself.

@richb-hanover
Copy link
Contributor Author

I went with a task name of "install-playground-npm-dependencies".

Then we need to figure out how to trigger the npm install when prql-compiler sources change as well. Thanks.

# install prql-js as the regular dependency, instead of creating a
# symlink. Refer to https://github.com/PRQL/prql/pull/1296.
- npm install --install-links=false
- task: install-playground-npm-dependencies
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was thinking this wouldn't be the default under run-playground — there would be a separate task run-playground-npm-dependencies

The default needs to pick up the full path, otherwise it would be v confusing that it's using old rust code.

I'm not sure it worth investing much in us detecting the rust changes, since it might be difficult and it will be replaced by the proper solution when we can do #1836

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmmm... This PR is meant as a convenience for people doing documentation. The current main branch always runs the full npm install (taking ~1 minute), for both task run-web and task run-playground, even if I ^C the one, start the other, with no changes to any of the source files.

On further inspection, I see there's a task run-website that does what I want. It simply serves out the (previously-built) static site. I have created a similar task run-pg that people can use to avoid the npm step completely.

I believe it's useful to keep the separate install-playground-npm-dependencies task because it factors out the (unexpected) --install-links option into a single function.

Potential To-do's:

  1. Detect changes to prql-compiler source file in install-playground-npm-dependencies
  2. Regularize the task names
    • build-web - builds the website (the current build-web)
    • build-run-web - builds and runs the website (the current run-web)
    • run-web - run previously-built static site (the current run-website)
    • similar names for build-playground, build-run-playground, run-playground
    • similar names for run-book, just for completeness

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(responding quickly, forgive me for brevity!)

  • build-web - builds the website (the current build-web)

    • build-run-web - builds and runs the website (the current run-web)

    • run-web - run previously-built static site (the current run-website)

I've been referring to web as all three — book / playground / website — and website as just the static site (at prql-lang.org). Does that change your view at all?

Detect changes to prql-compiler source file in install-playground-npm-dependencies

As I say above — if you want to do this then you're welcome to, but it'll go away shortly when #1836 is unblocked (by Debian rust version...)

On further inspection, I see there's a task run-website that does what I want. It simply serves out the (previously-built) static site.

The run- steps also build things IIUC. But yes OK; we could have run- skip the building and have build-run do both? (Though, again, soon building will be very cheap, so it might be a smaller change to just add your new task as a suffix of -cached and then we refactor it out when we do #1836?)

@@ -300,6 +300,12 @@ tasks:
- task: install-playground-npm-dependencies
- npm start

run-pg:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is OK! I would have voted for something clearer — not obvious how pg is different from playground (though it is elegant in a way! Smaller and less descriptive, just like running vs building!!)

@max-sixty
Copy link
Member

If it's OK, I'm going to merge, and then we do any follow-ups in a separate PR?

@max-sixty max-sixty merged commit d614a52 into PRQL:main Jun 13, 2023
@richb-hanover
Copy link
Contributor Author

@max-sixty This is great. I'll be delighted when building is cheap. In the mean time, I don't have to be grumpy waiting for npm install every. single. time. I. run. :-) Thanks.

@richb-hanover richb-hanover deleted the npm-cache-task branch June 13, 2023 19:06
max-sixty added a commit to max-sixty/prql that referenced this pull request Jun 17, 2023
I hadn't realized that PRQL#2801 stopped building the rust code by default. This changes it slightly so running `run-playground-cached` avoids building the rust code.

It removes `run-pg`, since the JS build without the rust build takes just a few seconds
max-sixty added a commit to max-sixty/prql that referenced this pull request Jun 20, 2023
I hadn't realized that PRQL#2801 stopped building the rust code by default. This changes it slightly so running `run-playground-cached` avoids building the rust code.

It removes `run-pg`, since the JS build without the rust build takes just a few seconds
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants