Implement loadtxt and savetxt #23

certik · 2019-12-19T18:24:15Z

This also includes a minimal CMake build system. We can improve the
build system in further PRs.

Fixes #16.

So that it does not conflict with CMake generated makefiles. To use them, execute make with: make -f Makefile.manual

This also includes a minimal CMake build system. We can improve the build system in further PRs. Fixes fortran-lang#16.

milancurcic · 2019-12-19T18:37:34Z

Thanks @certik. Should we make feature PRs to go into feature branches, for example feature/loadtxt?

certik · 2019-12-19T19:27:02Z

My own preference is to simply send PRs against master. That seems to scale really well even for big projects and it's simple for newcomers to understand. But if others feel we should use another workflow, I am happy to adjust. How would the feature branch work? Would it be long lived?

What I was thinking is simply merging to master and we would use master as the latest version, that way people can easily test it out etc.

However, one thing that I am not sure if we should have an "experimental" section, perhaps src/experimental. And put new features there first. The module would be called stdlib_x_io, or something like that, to show that it is experimental (i.e. freshly implemented) and we need to gain experience actually using it to see if we are willing to commit to (forever) maintain backwards compatible API.

jvdp1 · 2019-12-19T19:39:51Z

This also includes a minimal CMake build system. We can improve the
build system in further PRs.

Fixes #16.

Nice.
Regarding extension to other types than dp (e.g. float, integer4/8,...), would it be beter using data polymorphism (unlimited polymorphic) or repeating the procedures and overloading (maybe a question for the issues)?

certik · 2019-12-19T19:41:41Z

@jvdp1 good point, I think the double precision is the most common, so we can start with that. We either have to repeat it by hand, or use some templated systems (there are a few) that generate it for us.

milancurcic · 2019-12-19T20:01:55Z

However, one thing that I am not sure if we should have an "experimental" section, perhaps src/experimental. And put new features there first. The module would be called stdlib_x_io, or something like that, to show that it is experimental (i.e. freshly implemented) and we need to gain experience actually using it to see if we are willing to commit to (forever) maintain backwards compatible API.

Could a git branch (devel or even experimental) be used for this purpose, like @zbeekman suggested in #5?

jvdp1 · 2019-12-19T20:07:40Z

@certik I often use single-precision ;) This could be also an issue for other modules (e.g. linalg?). So it would be maybe good to discuss it broadly at some point. I never work with templated systems, but I am not against. It would be probably more efficient (easier?) than using unlimited polymorphism.

certik · 2019-12-19T20:12:53Z

@milancurcic it can. But things can stay experimental for quite some time (I can easily see something being experimental for a year). And managing two branches becomes painful. For example, with this PR, it requires some infrastructure setup (CMake, etc.), so if we put it in an experimental branch, we have to redo the CMake setup in another PR which will need it also. We can extract the CMake stuff, put into master, and try to keep the experimental branch small. But it's still going to be quite some work. Then how about releases? If everything is in master, we just need to create one release tarball and everybody can test it out. If we have an experimental branch, do we release two tarballs? We can, but it feels like a lot of administrative overhead.

I was thinking of doing something like C++ does:

https://en.cppreference.com/w/cpp/experimental

As an example: https://en.cppreference.com/w/cpp/experimental/parallelism_2, the experimental (new) feature is simply in an experimental/... header file, but my understanding is that it is part of the main standard library (e.g., part of "master").

certik · 2019-12-19T20:18:36Z

I would suggest to develop like Microsoft develops the C++ standard library. Here is their main repo:

https://github.com/microsoft/STL

Only one branch (master). The experimental features are in master, in the experimental directory, e.g.:

https://github.com/microsoft/STL/blob/28ec9a32952e0d7443936f8d5ae5d675ba6cf65c/stl/inc/experimental/deque

Here is an example of a PR, against master, with an experimental feature:

microsoft/STL#361

It's simple, it's proven, it works for C++ and Microsoft. And then if we need to make some adjustment to a proven workflow, we can.

certik · 2019-12-19T20:19:36Z

@jvdp1 good point, we should implement single precision version also in this PR.

milancurcic · 2019-12-19T20:29:19Z

@certik Got it, I didn't consider all the separate infrastructure that would be needed, and indeed that would be a pain in a separate branch. Having a separate repo for this would also be a pain I think. Separate directory as you suggest seems reasonable.

src/stdlib_io.f90

certik · 2019-12-19T21:59:39Z

Ok, so let's just use master with a separate directory called experimental, as the C++ stdlib. How should we rename the module? Because in C++, you import as #include <experimental/deque>, so in your code you know you are using an experimental API. In Fortran, we do not import using the path, so we need to rename the module (until j3-fortran/fortran_proposals#86 is implemented). Here are some ideas how to rename it:

stdlib_x_io
x_stdlib_io
stdlib_io_x
stdlib_experimental_io

The last one stdlib_experimental_io is probably in line with the C++ idea, i.e. #include <experimental/deque> would correspond to stdlib_experimental_deque. Also by spelling experimental fully would be consistent with the name of the directory.

src/stdlib_io.f90

milancurcic · 2019-12-19T22:03:21Z

src/stdlib_io.f90

+logical function whitechar(char) ! white character
+! returns .true. if char is space (32) or tab (9), .false. otherwise
+character, intent(in) :: char
+if (iachar(char) == 32 .or. iachar(char) == 9) then


stdlib_ascii module will be useful here to not use literal constants. Minot nitpick as ascii constants won't change any time soon, but nevertheless.

I intentionally didn't expose whitechar as public, as we might want to change the API. Once we implement stdlib_string we can put all these in it and polish it up.

We have a "circular dependency" here. I would like to submit a pull request for stdlib_ascii but I was waiting to have some CMake machinery set up and so I could use assert in the unit tests.

stdlib_ascii module will be useful here to not use literal constants. Minot nitpick as ascii constants won't change any time soon, but nevertheless.

Internally in stdlib_ascii I am also using both literal character and hexadecimal constants for the symbols in the ascii table. I see no other portable way. Of course compiler vendors targeting specific processors with other default collating sequences could implement their own low-level versions. I guess another option would be to hack something up using the transfer intrinsic and bit-mask operations, but I see no benefit.

Edit: probably I misunderstood your comment, which was implying to use something like char == ascii_tab .and. char == ascii_space instead of cryptic ascii sequence integers. The stdlib_ascii module will have a is_blank function, which can be used instead of whitechar.

src/stdlib_io.f90

certik · 2019-12-20T18:13:32Z

Let's keep the discussion going, so that we can merge this.

Should I move the io module into: stdlib_experimental_io per the discussion above?

certik · 2019-12-20T23:03:28Z

@milancurcic please let me know your opinion regarding stdlib_experimental_io, see above.

certik · 2019-12-21T04:59:33Z

I moved all new code to stdlib_experimental in 559bfd7. If feels right, because now we don't need to get everything 100% right in each PR. We just need to get it mostly right, and then the rest we can improve collaboratively with subsequent PRs, and get some real world usage, until we are all convinced that the functionality is rock solid. Then we can move it to stdlib_io from experimental.

certik · 2019-12-21T05:09:47Z

I also just added a single precision version.

@milancurcic, @marshallward, @jvdp1 would you mind giving it another review please?

jvdp1 · 2019-12-21T09:06:35Z

src/stdlib_experimental_io.f90

+real(dp), allocatable :: tmp(:,:)
+call dloadtxt(filename, tmp)


*This implies a additional copy of the array d (in dp). This could be quite inefficient for large files/arrays.
*This way could be also difficult to generalize for quad precision qp.

I implemented a more general solution and extended it to qp. How could I propose these changes?

certik · 2019-12-21T13:25:40Z

Send a PR against my branch in my fork. When I merge it, your commits will appear here.

…

On Sat, Dec 21, 2019, at 2:07 AM, Jeremie Vandenplas wrote: ***@***.**** commented on this pull request. In src/stdlib_experimental_io.f90 <#23 (comment)>: > +real(dp), allocatable :: tmp(:,:) +call dloadtxt(filename, tmp) *This implies a additional copy of the array d (in dp). This could be quite inefficient for large files/arrays. *This way could be also difficult to generalize for quad precision qp. I implemented a more general solution and extended it to qp. How could I propose these changes? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#23?email_source=notifications&email_token=AAAFAWA2MNUAQVM2HKGVIJTQZXMERA5CNFSM4J5MX6U2YY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCQAMZGI#pullrequestreview-335596697>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAAFAWEHKRYZNZMMGHEFCXLQZXMERANCNFSM4J5MX6UQ>.

certik · 2019-12-21T13:39:32Z

The alternative workflow is that we merge this PR, and you simply send a PR with improvements against master. In fact I think I would prefer that --- simpler and it scales better, as there are a lot more improvements that we need to make:

you can work on better single and quadruple support
I can work on setting up a CI
somebody else can work on the ascii module, ...

@milancurcic would you be ok with merging this PR now so that others can send subsequent PRs?

jvdp1 · 2019-12-21T14:39:43Z

The alternative workflow is that we merge this PR, and you simply send a PR with improvements against master. In fact I think I would prefer that --- simpler and it scales better, as there are a lot more improvements that we need to make:

you can work on better single and quadruple support

I can work on setting up a CI

somebody else can work on the ascii module, ...

@certik , @milancurcic At this point, I think it is the easiest solution indeed. @ivan-pi ? is maybe also waiting on a first implementation for the ascii module

milancurcic · 2019-12-21T14:41:07Z

I like the idea of implementations going into the staging area like experimental and getting further work there before becoming part of the "stable" stdlib. stdlib_experimental_io is okay with me.

To move faster, I also agree that we can merge PRs into stdlib/experimental. There will be quite a few things that we'll want to fix or change, and this could keep the PR from being merged and slowing down next contributions that depend on it (but should really be their own PRs).

I think that we can merge PRs into stdlib/experimental as soon as:

Community agrees on the API (that's why better keep PRs small);
Tests pass

Basically, treat is as an MVP -- minimum viable product. Then we can refine with additional PRs, but other contributions that depend on this can be made because the code is merged in master.

I will give this another review now.

milancurcic · 2019-12-21T15:16:51Z

The only outstanding issue is that the test data (in src/tests/loadtxt/) are not copied or linked to where the test executables are built with CMake, if they're built in a separate directory. For example, my default habit was to do

mkdir build
cd build
cmake ..
make
ctest

In which case the tests fail because test data is not there.

We should somehow either ensure that test data is next to the test executables, or put an instruction in README.md that the library must be built in the top-level directory.

I will have a few suggestions for changes after merge, which can be one or more new PRs.

milancurcic

Good to go. We can build with cmake from top-level directory for now and figure out the test data can be best kept accessible from executables later.

certik · 2019-12-21T22:11:31Z

@jvdp1 it's merged! Go ahead and submit PRs against master now (into an experimental module).

certik · 2019-12-21T22:39:01Z

@milancurcic what you wrote in #23 (comment) is I think exactly how we should do it. Merging to experimental modules can be treated similarly as merging to any other opensource project --- the community must agree on the API, tests must pass and it must pass review. It doesn't have the be 100% ready, as in this PR --- but we must be able to get to the 100% solution by sending subsequent PRs.

Once we get to the 100% solution in experimental, we'll have to figure out another workflow how to move it from experimental into main. For now our workflow is enough to keep going.

Thanks for the review!

merge

certik added 3 commits December 19, 2019 11:20

Move Makefile -> Makefile.manual

65d8d59

So that it does not conflict with CMake generated makefiles. To use them, execute make with: make -f Makefile.manual

Implement loadtxt and savetxt

7a7ca5f

This also includes a minimal CMake build system. We can improve the build system in further PRs. Fixes fortran-lang#16.

Add tests for loadtxt and savetxt

9d0d3aa

certik mentioned this pull request Dec 19, 2019

loadtxt and savetxt #16

Closed

milancurcic reviewed Dec 19, 2019

View reviewed changes

src/stdlib_io.f90 Outdated Show resolved Hide resolved

certik commented Dec 19, 2019

View reviewed changes

src/stdlib_io.f90 Outdated Show resolved Hide resolved

milancurcic reviewed Dec 19, 2019

View reviewed changes

jvdp1 reviewed Dec 19, 2019

View reviewed changes

src/stdlib_io.f90 Outdated Show resolved Hide resolved

certik mentioned this pull request Dec 20, 2019

How should stdlib handle single, double, quadruple precision types #25

Closed

certik added 4 commits December 20, 2019 16:24

Update src/stdlib_io.f90

8d33ead

Use :: after public

43ed837

Remove stdlib_types, use iso_fortran_env instead

eff8a6f

Move the code to stdlib_experimental

559bfd7

certik added 3 commits December 20, 2019 22:04

Implement single precision version

5e9565e

Refactor the test

57d517f

Add a test for single precision

65301b9

jvdp1 reviewed Dec 21, 2019

View reviewed changes

certik mentioned this pull request Dec 21, 2019

Standard library proposals j3-fortran/fortran_proposals#104

Open

milancurcic approved these changes Dec 21, 2019

View reviewed changes

certik merged commit bee64c5 into fortran-lang:master Dec 21, 2019

certik deleted the loadtxt branch December 21, 2019 22:11

certik mentioned this pull request Dec 30, 2019

improve cmake build #51

Merged

jvdp1 pushed a commit to jvdp1/stdlib that referenced this pull request Oct 2, 2021

Merge pull request fortran-lang#23 from fortran-lang/master

ad180de

merge

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement loadtxt and savetxt #23

Implement loadtxt and savetxt #23

certik commented Dec 19, 2019

milancurcic commented Dec 19, 2019

certik commented Dec 19, 2019 •

edited

Loading

jvdp1 commented Dec 19, 2019

certik commented Dec 19, 2019

milancurcic commented Dec 19, 2019

jvdp1 commented Dec 19, 2019

certik commented Dec 19, 2019

certik commented Dec 19, 2019

certik commented Dec 19, 2019

milancurcic commented Dec 19, 2019

certik commented Dec 19, 2019

milancurcic Dec 19, 2019

certik Dec 19, 2019

ivan-pi Dec 19, 2019

ivan-pi Dec 21, 2019 •

edited

Loading

certik commented Dec 20, 2019

certik commented Dec 20, 2019

certik commented Dec 21, 2019

certik commented Dec 21, 2019

jvdp1 Dec 21, 2019

certik commented Dec 21, 2019 via email

certik commented Dec 21, 2019

jvdp1 commented Dec 21, 2019

milancurcic commented Dec 21, 2019

milancurcic commented Dec 21, 2019

milancurcic left a comment

certik commented Dec 21, 2019 •

edited

Loading

certik commented Dec 21, 2019 •

edited

Loading

		real(dp), allocatable :: tmp(:,:)
		call dloadtxt(filename, tmp)

Implement loadtxt and savetxt #23

Implement loadtxt and savetxt #23

Conversation

certik commented Dec 19, 2019

milancurcic commented Dec 19, 2019

certik commented Dec 19, 2019 • edited Loading

jvdp1 commented Dec 19, 2019

certik commented Dec 19, 2019

milancurcic commented Dec 19, 2019

jvdp1 commented Dec 19, 2019

certik commented Dec 19, 2019

certik commented Dec 19, 2019

certik commented Dec 19, 2019

milancurcic commented Dec 19, 2019

certik commented Dec 19, 2019

milancurcic Dec 19, 2019

Choose a reason for hiding this comment

certik Dec 19, 2019

Choose a reason for hiding this comment

ivan-pi Dec 19, 2019

Choose a reason for hiding this comment

ivan-pi Dec 21, 2019 • edited Loading

Choose a reason for hiding this comment

certik commented Dec 20, 2019

certik commented Dec 20, 2019

certik commented Dec 21, 2019

certik commented Dec 21, 2019

jvdp1 Dec 21, 2019

Choose a reason for hiding this comment

certik commented Dec 21, 2019 via email

certik commented Dec 21, 2019

jvdp1 commented Dec 21, 2019

milancurcic commented Dec 21, 2019

milancurcic commented Dec 21, 2019

milancurcic left a comment

Choose a reason for hiding this comment

certik commented Dec 21, 2019 • edited Loading

certik commented Dec 21, 2019 • edited Loading

certik commented Dec 19, 2019 •

edited

Loading

ivan-pi Dec 21, 2019 •

edited

Loading

certik commented Dec 21, 2019 •

edited

Loading

certik commented Dec 21, 2019 •

edited

Loading