APIv4 - Return id_field as part of Entity.get #20457

colemanw · 2021-06-01T02:40:38Z

Overview

Improves APIv4 metadata so it returns the name of the unique identifier field for each entity (usually but not always named "id").

Technical Details

All entities have a unique identifier field, usually named 'id' but some entities the field is named something else. e.g. Afform uses 'name' as the identifier.

This returns the name of the field as part of Entity.get, and it's also available directly from each API class e.g. Contact::getInfo().

All entities have a unique identifier field, usually named 'id' but some entities the field is named something else. e.g. Afform uses 'name' as the identifier. This returns the name of the field as part of Entity.get, and it's also available directly from each API class e.g. Contact::getInfo().

civibot · 2021-06-01T02:40:40Z

(Standard links)

If this is your first pull-request for CiviCRM, please browse CONTRIBUTING.md for information about the development and testing processes.
If you are reviewing this pull-request, you may wish to consult the test sites and the Review Standards (long template, short template).

colemanw · 2021-06-01T02:41:23Z

Civi/Api4/Generic/BasicEntity.php

@@ -132,8 +132,17 @@ public static function delete($checkPermissions = TRUE) {
   * @return BasicReplaceAction
   */
  public static function replace($checkPermissions = TRUE) {
-    return (new BasicReplaceAction(static::getEntityName(), __FUNCTION__))
+    return (new BasicReplaceAction(static::getEntityName(), __FUNCTION__, static::$idField))


This was a bug. Tests caught it when I changed the name of the id_field for this mock entity.

colemanw · 2021-06-01T02:46:06Z

@totten with this PR you can get the id_field for any API entity directly from the API class. If you don't know the name of the class, CoreUtil can look it up for you.

CoreUtil::getApiClass('Contact')::getInfo()['id_field']; // returns 'id'
CoreUtil::getApiClass('Afform')::getInfo()['id_field']; // returns 'name'

eileenmcnaughton · 2021-06-01T02:58:00Z

@colemanw I don't suppose this PR makes it better or worse but I do have some concerns about caching for the Entity::get data - it doesn't seem to be cached?

colemanw · 2021-06-01T03:36:04Z

@colemanw I don't suppose this PR makes it better or worse but I do have some concerns about caching for the Entity::get data - it doesn't seem to be cached?

No this PR doesn't affect that.
You're right it isn't cached anywhere. Maybe it would be good to do so, or maybe it would be a waste of memory since it really isn't used for much. Afaik it's only used by the API Explorer and the Search Kit admin screen.

eileenmcnaughton · 2021-06-01T03:39:48Z

@colemanw we added a call for it to the monolog extension because sometimes logging is called before the entity is available & it fatals - but it seems kinda expensive tbh

totten · 2021-06-01T06:33:38Z

FWIW, caching was also my main concern about getInfo().

Granted, CustomValue::getInfo() is fairly light - but in all other variants, you have things like:

Parse doc blocks (AbstractEntity::getInfo())
Loop through all DAO fields and highlight FKs (EntityBridge::getInfo())
Loop through all traits and cleanup names (AbstractEntity::getInfo())

Additionally, note that - even with its fairly conservative usage right now - there are things like getBAOFromApiName($entityName) which rely on getInfo(). As a consumer, I would expect that getBAOFromApiName() would be pretty light (ie it wouldn't need to parse docblocks+fields to figure that out).

There is a parallel issue around caching the list of fields, although the status-quo is a bit different. It's already cached... Each Action instance has a method $this->entityFields() with a copy of the field list. This is useful for, say, looping through fields and applying validation-rules. But if the validation work is being done by a subscriber, then it can't access $apiRequest->entityFields() because it's protected.

To my eye, the entity-info and the entity-fields are very similar metadata and should follow the same lifecycle / coding-style / visibility.

I guess the first question about caching is more like... how long should the cache live?

(a) Cache should live for many PHP requests (e.g. Civi::cache('long') or Civi::cache('fields'))
(b) Cache should live for many PHP requests, but only if it's memcache/redis (e.g Civi::cache('short'))
(c) Cache should live for the duration of one PHP request (e.g. Civi::$static)
(d) Cache should live for the duration of one API call (e.g. $apiRequest->entityFields())
(e) Cache should be impromptu, driven by each consumer of the data
(f) Metadata should not be cached

Personally, my first pick would be (c) per-request/Civi::$static, but really anything from (a)-(d) seems OK as long as:

the data is accessible from listeners/subscribers
we can change our mind (ie there's a clear spot where that policy has been implemented)

eileenmcnaughton · 2021-06-01T07:03:14Z

I think the performance here degrades the more extensions you have installed? Which probably pushes me to a or b

Note that we use the Civi::cache('metadata') more than Civi::cache('fields') - I think the policy is the same but metadata is flushed in more places - whenever an entity likely to relate to metadata (eg. PriceSet, MembershipType, Custom Field) is changed

seamuslee001 · 2021-06-01T08:13:38Z

Broadly this looks sensible to me and I can see how it would help. I tend to agree with Eileen re caching options

colemanw · 2021-06-01T11:55:01Z

Tests are happy, let's merge this & do caching in a separate PR.

totten · 2021-06-01T21:26:32Z

I think the performance here degrades the more extensions you have installed? Which probably pushes me to a or b

I don't think so... At the risk of being pedantic, my read on the trade-off (qty computations vs qty local-memory vs qty external IO calls) is that the typical resource-usage (Θ(...)) for a PHP request would behave as follows:

With (d) API-call-caching
- Computation: Θ(total #API calls) (if you call Foo.create 10x, then you compute Foo metadata 10x)
- Memory: Θ(max #nested API calls) (if you call Foo.replace which recurses to Foo.create, then you have 2x copies of Foo metadata)
- I/O: Θ(1)
With(c) per PHP-request static-caching
- Computation: Θ(#distinct activated API entities) (if you call Foo.create 10x, then you compute Foo 1x)
- Memory is also Θ(#distinct activated API entities)
- I/O: Θ(1)
(With (b), it depends on environment, but either it behaves like (c) or (a).)
With (a) long-caching (and no prefetching or batching)
- Computation Θ(1)
- Memory: Θ(#distinct activated API entities)
- I/O: Θ(#distinct activated API entities)
With (a) long-caching (and some kind of prefetching or batching)
- Computation: Θ(1)
- Memory: Θ(#entities in the entire system) (if you have have extensions which cumulatively define 100 entities, then you load 100x metadata into memory)
- I/O: Θ(#entities in the entire system)

I suppose in some scenarios, the #distinct activated entities and #entities in the entire system would be similar (e.g. long-running batch-processor with heterogeneous tasks; e.g. building a power-tool akin to API Explorer). In that edge-case, maybe (a)+prefetching is technically a little better, but probably not noticeably better, and I don't think those edge-cases are representative.

totten · 2021-06-01T22:02:51Z

My gut agrees with you that we don't want Θ(#entities in the entire system). So probably not (a)+prefetch. (Though that could be wrong... it's just a gut sense...)

(c) and (a)+no-prefetch have the same formulas except they trade computations versus I/O. (Surprise!) IMHO, between those, it boils down to:

How much you fear stale caches. (Longer caches mean greater impact from cache-bugs...)
Whether you think 1x computation of getInfo()/entityFields() will be faster or slower than 1x external I/O for the same. (Gutguessgamblopinion: APCU-IO beats computation which beats SQL-IO, but Secretariat beats all of them.)

eileenmcnaughton · 2021-06-01T22:11:05Z

The io is something like 2 or 3 file look ups per entity per enabled extension or module isn't it?

totten · 2021-06-02T00:45:57Z

I'm not seeing that with respect to getInfo per se...

Ex: Suppose you run Contact.create. It needs metadata (defaults, idField, etc), so it calls getInfo()/entityFields(). It's true that this is influenced by 3x PHP files for the specific entity(CRM/Contact/DAO/Contact.php, CRM/Contact/BAO/Contact.php, Civi/Api4/Contact.php, etc). However, those 3x PHP files would be required anyway... because you can't read or write Contacts without all 3 files.

The reason why I think caching makes sense for getInfo() or entityFields() is that they involve additional work, above/beyond just requireing the PHP file. (To wit: scanning/filtering docblocks, traits, fields.) I think it's entirely reasonable to have 5 different callers that want to know the id_field, title, dao, etc -- but I don't think that each read of id_field should trigger a scan of docblocks+traits+fields.

civibot bot added the master label Jun 1, 2021

colemanw commented Jun 1, 2021

View reviewed changes

colemanw merged commit a076350 into civicrm:master Jun 1, 2021

colemanw deleted the id_field branch June 1, 2021 11:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

APIv4 - Return id_field as part of Entity.get #20457

APIv4 - Return id_field as part of Entity.get #20457

colemanw commented Jun 1, 2021

civibot bot commented Jun 1, 2021

colemanw Jun 1, 2021

colemanw commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

colemanw commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

totten commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

seamuslee001 commented Jun 1, 2021

colemanw commented Jun 1, 2021

totten commented Jun 1, 2021 •

edited

Loading

totten commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

totten commented Jun 2, 2021

APIv4 - Return id_field as part of Entity.get #20457

APIv4 - Return id_field as part of Entity.get #20457

Conversation

colemanw commented Jun 1, 2021

Overview

Technical Details

civibot bot commented Jun 1, 2021

colemanw Jun 1, 2021

Choose a reason for hiding this comment

colemanw commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

colemanw commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

totten commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

seamuslee001 commented Jun 1, 2021

colemanw commented Jun 1, 2021

totten commented Jun 1, 2021 • edited Loading

totten commented Jun 1, 2021

eileenmcnaughton commented Jun 1, 2021

totten commented Jun 2, 2021

totten commented Jun 1, 2021 •

edited

Loading