Add new data encapsulation #65

LSchueler · 2020-01-15T18:10:31Z

2 new classes are added in order to separate data and methods more
cleanly. This is very much WIP, but I'd like to get a discussion
started before I put in too much work.
The FieldData class contains information about the mesh and it
holds a dict containing the individual field values, which are
defined on that mesh, e.g. "krige" and "cond".

I'd like to more clearly differentiate between a class instance of Data and the actual field values contained in that instance, i.e. the numpy array. Any ideas?
For a better discussion, I used several different ways to access both, like

field["srf"]
field.values
field.field

Questions

Should the class Field be renamed to FieldMethods?
If a field is overwritten, with, say a new mesh_type or new pos, should only the field values be deleted or everything, including mean, ...?
What should the __call__ methods return?
- the dictionary containing the field names and the Data instances
- the Data instance of the default field
- the values of the default field
Should the default_field in the Data class always be called "field", or should the name be more descriptive, like "srf" or "krige"?

TODO

cleanup
overload the __setattr__ method in order to assign fields to each other
create a better separation between the Data instances and the actual field values
check for attributes being set to one of the Field children, which now belong to Data
documentation
get all the tests to run

2 new classes are added in order to separate data and methods more cleanly. This is very much WIP, but I'd like to get a discussion started before I put in too much work. The `FieldData` class contains information about the mesh and it holds a dict containing the individual field values, which are defined on that mesh, e.g. "krige" and "cond".

This way, we do not need a temporary field to store the mean value.

MuellerSeb · 2020-01-26T17:24:37Z

I love the approach of being able to access data by using the index operator.
To be backward-compatible we should provide the "field" attribute, which should point to a "default-field". The rest can change.

One issue I have with the approach is, that it is almost a re-implementation of a mesh-container like in meshio:
https://github.com/nschloe/meshio/blob/792dfdd6296a8858e365a005a87afa605834ce66/meshio/_mesh.py#L9

In pyevtk, the export routine "pointsToVTK" also creates a mesh, where the "cells" are simple vertices:
https://github.com/paulo-herrera/PyEVTK/blob/41d7e14e9b92909088b3e789632b1fd518b60d22/src/hl.py#L221

So in general, I would follow the definition of a VTK file, where we have:

points (pos in our case),
cells (vertices for unstructured, hypercubes for structured),
point-data (scalar or vector [or tensor in VTK]) -> we are interested in this
field-data (maybe here we can store sth like the default field name)
cell-data (i dont think we need this)
all *-data as dict{str: np.ndarray / single value}

The Data could also be dropped. Since the value_type could be determined from the data shape and the mesh_type. Also the mean should only be connected with the current Field class.
For the class-names I would propose:

Data - drop
FieldData -> Mesh (with point_data and field_data {"default_field": "field"})
Field -> GSField (since it's the main actor in "GS"Tools) with:
- "mean" as a field_data entry in the Mesh and a mean attribute pointing there
- field attribute pointing to the point_data with the name of the default_field (from field_data)
value_type -> function determining the value_type of a numpy array (taking mesh_type and pos.shape)
within the current Field.mesh method, we could then also overwrite the underlying mesh with the given one.
in every call routine, we could leave the "pos" parameter optional to use the already present underlying mesh if nothing was given
if a new pos vector is given, the underlying point_data is erased

Then we could easy export meshes with meshio (at least in the unstructured case). Or we could totally move to pyvista and use it for IO and plotting, since there is no option to export a structured. (or we could just covert it to unstructured when exporting)

I guess @banesullivan would be happy about that, since we come closer to the VTK representation of a mesh.

These were my two cents ;-)

MuellerSeb

See the other comment.

MuellerSeb · 2020-01-26T17:53:36Z

gstools/field/base.py



-class Field:
+class Data:


This can be dropped. (See comment)

MuellerSeb · 2020-01-26T17:53:55Z

gstools/field/base.py

+        self.value_type = value_type
+
+
+class FieldData:


This could be called a Mesh class.

MuellerSeb · 2020-01-26T17:54:28Z

gstools/field/base.py

+            raise ValueError("Unknown 'mesh_type': {}".format(mesh_type))
+
+
+class Field(FieldData):


This could be the GSField class, since it is our working horse.

LSchueler · 2020-02-07T13:50:30Z

Thanks for your pretty thought out input!

I dig the name Mesh.

Are you suggesting to use nested dicts for storing something like the variance of the kriged field, like

m = Mesh()
m.add_field("krige_field", ...)
m.field_data["krige_field"]["var"] = 3.14

with the key of the field_data being the same as the name of the point_data? The alternative would be to keep the extra Data class, which we could of course rename to FieldData.

LSchueler · 2020-02-07T14:12:28Z

As we don't have many variables belonging to auxiliary fields, we should simply add them to the Mesh class like var_krige.

MuellerSeb · 2020-04-28T11:19:50Z

When a field is called with a new pos tuple, the mesh should be resetted.
For unstructured meshes, an append option could be interesting, for situations, where you want to iteratively generate new points without resetting the mesh.

LSchueler · 2020-04-29T16:23:40Z

I've changed the SRF-class and especially the call method to being more mesh-centric. In this concept, it doesn't make a whole lot of sense to give the position-tuple everytime an SRF is to be calculated. The pos-tuple belongs to the mesh.

To not break the backwards compatibility, I've included warnings.

LSchueler · 2020-04-29T16:25:35Z

@MuellerSeb Please explain to me how TestKrige.test_extdrift works! You are using a structured grid, but you are generating srf's on unstructured grids. That's the only test, which does not work on this branch.

LSchueler · 2020-04-29T16:28:32Z

The way the conditioning functions get the raw_field is pretty ugly at the moment. But they are so tightly coupled to the SRF class, would it make sense to rework these functions and bring them into the SRF class?

MuellerSeb · 2020-04-29T17:13:07Z

@MuellerSeb Please explain to me how TestKrige.test_extdrift works! You are using a structured grid, but you are generating srf's on unstructured grids. That's the only test, which does not work on this branch.

I'll have a look at it.

MuellerSeb · 2020-04-29T17:38:25Z

@LSchueler : Found the problem. it is in the Mesh._check_point_data routine. The input pos tuple does not consist of flat arrays in this example. The Mesh class doesn't include any conversion of the pos-tuple like it was done before in the SRF class.

In the specific case, the x component of pos has a shape of (51, 61) and the field has a shape of 3111 (=51*61). in the checking routine this shape is compared to only the first entry of the shape of x (51) and there the error is thrown, that there is a shape missmatch.

I think, the pos tuple components should be flattened when the mesh is created (and converted to numpy arrays with floats).

MuellerSeb · 2020-04-29T18:39:15Z

Checklist:

add dim input and attribute to mesh class
pos tuple needs to be converted in __init__ (depending on dim as an array shaped (dim, pos_count) )
add axis input and attribute to mesh for sturctured meshes (pos then generated from axis on call)
add a convert method to convert structured to unstructured meshes
add a reset method to mesh, which takes the same arguments as init (init should call that)
any Field calls where pos is given should reset the unerlying mesh with the reset method (depending on the mesh_type, pos should be interpreted as pos or axis like it was before) )
SRF may always use the unstructured mesh internally (structured will be only used for variogram)... not sure about that one

LSchueler · 2020-04-30T10:26:00Z

Thanks for compiling our chat from yesterday!

MuellerSeb · 2020-07-21T22:28:29Z

Do we need another chat on that? Would be lovely to see this getting merged! :-)

LSchueler · 2020-07-29T13:37:38Z

We have created such spaghetti code...

MuellerSeb · 2020-07-30T12:51:47Z

We have created such spaghetti code...

Oh no... now I feel guilty!

MuellerSeb · 2020-08-18T11:24:18Z

Let's focus on GSTools v2 with this one.

MuellerSeb · 2020-11-13T13:55:52Z

Some new thoughts on this:

actually I am quite fine with the way it is now: SRF/Krige/Field hold the information for the last generated field (pos + values)
with the mesh class, we could provide a container living outside all these Field classes
when evaluating on a mesh class, we can use the already present method: Field.mesh (link
then we can provide simpler interfaces to meshio, ogs5py, pyvista, vtk, and so on
code refactoring is also easier, since we don't have to care about multiple fields stored somewhere buried deep beneath __get_items__ magic functions

LSchueler · 2021-01-26T11:26:36Z

I'll close this PR for now, as GSTools has meanwhile gone into a different direction.

LSchueler requested a review from MuellerSeb January 15, 2020 18:10

LSchueler self-assigned this Jan 15, 2020

LSchueler added enhancement New feature or request help wanted Extra attention is needed Refactoring Code-Refactoring needed here labels Jan 15, 2020

LSchueler mentioned this pull request Jan 15, 2020

Field.__call__ has inconsistent returns: use dictionaries #60

Closed

MuellerSeb mentioned this pull request Jan 16, 2020

Add PyVista mesh support to Field #59

Merged

LSchueler added 8 commits January 20, 2020 17:00

Blackened

0bff5de

Fix bug in setter method

89e76b4

[WIP] Add methods for data checking

bb67411

Move mean calc. after krige field is added

9e53dc9

This way, we do not need a temporary field to store the mean value.

Add todo note

32a64fa

[WIP] Move field base classes and add docstrings

e18138b

[WIP] 'mean' back to c'tor and refactor add_field

d5053ad

SRF __call__ is now using correct default field

0dcd4bf

LSchueler added this to the 1.2 milestone Jan 24, 2020

LSchueler added 7 commits January 24, 2020 11:03

Remove a getter which is already defined in parent

60491a3

Remove (now) wrong doc line

12357f4

Add pos setter and getter

dfc8a38

Amend to commit 60491a3

f0f341f

Add more getters, setters

4eee22b

Add reset fct. and checks to FieldData class

c75a45c

[WIP] updating SRF class

b6bbb94

MuellerSeb requested changes Jan 26, 2020

View reviewed changes

MuellerSeb mentioned this pull request Jan 28, 2020

PyVista next steps #32

Closed

3 tasks

[WIP] Add new Mesh class for discussion

fe257c8

LSchueler added 3 commits April 29, 2020 18:13

Add method for del. field_data

1bb20e9

1d pos is possible again without tuple

974ec02

Refactor SRF to be more mesh-centric

e9e9334

Add dim to Mesh arg. list

5a6a6fe

MuellerSeb mentioned this pull request May 1, 2020

Lat-Lon support (geographical coordinates) #54

Closed

MuellerSeb mentioned this pull request May 9, 2020

[Mesh] mesh element generation for unstructured meshes #91

Closed

[WIP] Try to get tests to run

344990f

MuellerSeb modified the milestones: 1.3, 2.0 Aug 18, 2020

MuellerSeb linked an issue Aug 18, 2020 that may be closed by this pull request

A new tutorial for Field.mesh() #93

Closed

LSchueler closed this Jan 26, 2021

MuellerSeb mentioned this pull request Jul 27, 2021

Control of field storage in Field class and subscript option #196

Closed

LSchueler deleted the variogram_update branch August 10, 2021 07:30

MuellerSeb removed this from the 2.0 milestone Jun 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new data encapsulation #65

Add new data encapsulation #65

LSchueler commented Jan 15, 2020 •

edited

Loading

MuellerSeb commented Jan 26, 2020

MuellerSeb left a comment

MuellerSeb Jan 26, 2020

MuellerSeb Jan 26, 2020

MuellerSeb Jan 26, 2020

LSchueler commented Feb 7, 2020

LSchueler commented Feb 7, 2020

MuellerSeb commented Apr 28, 2020

LSchueler commented Apr 29, 2020

LSchueler commented Apr 29, 2020

LSchueler commented Apr 29, 2020

MuellerSeb commented Apr 29, 2020

MuellerSeb commented Apr 29, 2020

MuellerSeb commented Apr 29, 2020 •

edited by LSchueler

Loading

LSchueler commented Apr 30, 2020

MuellerSeb commented Jul 21, 2020

LSchueler commented Jul 29, 2020

MuellerSeb commented Jul 30, 2020

MuellerSeb commented Aug 18, 2020

MuellerSeb commented Nov 13, 2020

LSchueler commented Jan 26, 2021

		raise ValueError("Unknown 'mesh_type': {}".format(mesh_type))


		class Field(FieldData):

Add new data encapsulation #65

Add new data encapsulation #65

Conversation

LSchueler commented Jan 15, 2020 • edited Loading

Questions

TODO

MuellerSeb commented Jan 26, 2020

MuellerSeb left a comment

Choose a reason for hiding this comment

MuellerSeb Jan 26, 2020

Choose a reason for hiding this comment

MuellerSeb Jan 26, 2020

Choose a reason for hiding this comment

MuellerSeb Jan 26, 2020

Choose a reason for hiding this comment

LSchueler commented Feb 7, 2020

LSchueler commented Feb 7, 2020

MuellerSeb commented Apr 28, 2020

LSchueler commented Apr 29, 2020

LSchueler commented Apr 29, 2020

LSchueler commented Apr 29, 2020

MuellerSeb commented Apr 29, 2020

MuellerSeb commented Apr 29, 2020

MuellerSeb commented Apr 29, 2020 • edited by LSchueler Loading

Checklist:

LSchueler commented Apr 30, 2020

MuellerSeb commented Jul 21, 2020

LSchueler commented Jul 29, 2020

MuellerSeb commented Jul 30, 2020

MuellerSeb commented Aug 18, 2020

MuellerSeb commented Nov 13, 2020

LSchueler commented Jan 26, 2021

LSchueler commented Jan 15, 2020 •

edited

Loading

MuellerSeb commented Apr 29, 2020 •

edited by LSchueler

Loading