Make solr field/schema resolvers respect the tokenized field of attributes by clockard · Pull Request #7003 · codice/ddf

clockard · 2026-06-02T19:46:54Z

What does this PR do?

Updates the solr resolvers to respect the tokenized field of attribute definitions instead of assuming all text fields are tokenized.
Also makes metacard-tags tokenized.

One question I still have is why wildcard id queries don't work in the testing framework. By default the solr schema does have id defined as tokenized but that's not how the attribute is defined. It should work either way but it wasn't in the tests which is why I had to switch them to anyText filters.

Who is reviewing it?

@jaymcnallie
@jrnorth

Any background context you want to provide?

The default ddf solr schema uses primarily dynamic field definitions. These definitions determine how solr stores/handles the data and completely disregards the MetacardType definitions of attributes when it comes to fields like indexed and tokenized. If a downstream project wants to define their fields explicitly for better data handling the resolvers assumption of all string fields being tokenized breaks down and causes issues.

Notes on Review Process

Please see Notes on Review Process for further guidance on requirements for merging and abbreviated reviews.

Review Comment Legend:

✏️ (Pencil) This comment is a nitpick or style suggestion, no action required for approval. This comment should provide a suggestion either as an in line code snippet or a gist.
❓ (Question Mark) This comment is to gain a clearer understanding of design or code choices, clarification is required but action may not be necessary for approval.
❗ (Exclamation Mark) This comment is critical and requires clarification or action before approval.

CLAassistant · 2026-06-02T19:47:01Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

chris.lockard seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

jaymcnallie

Trying to figure out why the unit tests lit up red on the PR build. I think you introduced a query side regression by selectively writing the tokenized copy of the attribute data.

Queries also need to respect the tokenized/not-tokenized field behavior when they run. As it stands, fuzzy searches will look in tokenized fields for tokenized=false attributes and never get results because those fields are now empty.

…butes

jrnorth · 2026-06-05T18:15:43Z

+      return;
+    }
+
+    if (!descriptor.isTokenized()) {


I am not 100% confident that I know what all of these fields are for, but it looks like there are at least two cases where fields were added to the cache that will no longer be hit:

_phonetics for untokenized string fields when phonetics is enabled.
a. I don't know enough about this feature to know what this would impact.

_tpt for untokenized XML fields.
a. This one I am really not sure about because I don't see any references to _tpt in our schema.

jrnorth · 2026-06-05T18:22:07Z

      if (!isExact) {
-        field += tokenized;
+        field = getMappedPropertyName(propertyName, AttributeFormat.STRING, false);
      }


✏️ We can remove this case and pass isExact as the last argument above.

jrnorth · 2026-06-05T18:33:58Z

+  @Test
+  public void testGetFieldNumericalFallsBackToRequestedSuffix() {
+    // When no numerical field exists in cache, should fall back to the requested suffix
+    // fieldsCache is empty for a new resolver


✏️ It will have a minimum of three items, but yes it will not have anything for these fields.

jrnorth · 2026-06-05T19:23:27Z

    SchemaField schema = new SchemaField("testField_int", "tint");
-    schema.setSuffix("_int");
-    when(mockResolver.getSchemaField("testField", true)).thenReturn(schema);
+    schema.setSpecialSuffix("_int");


✏️ The special suffix is only used for the like visitor, so it can be removed in these other tests that set it.

clockard requested a review from jaymcnallie June 2, 2026 19:46

jrnorth reviewed Jun 2, 2026

View reviewed changes

jaymcnallie requested changes Jun 2, 2026

View reviewed changes

jaymcnallie requested changes Jun 4, 2026

View reviewed changes

Comment thread platform/solr/solr-query/src/main/java/org/codice/solr/query/SchemaFieldResolver.java Outdated

clockard force-pushed the fix-solr-field-reslover branch from d2c89f0 to 2981850 Compare June 4, 2026 16:40

jaymcnallie approved these changes Jun 4, 2026

View reviewed changes

chris.lockard added 2 commits June 5, 2026 09:09

Make solr field/schema resolvers respect the tokenized field of attri…

0767441

…butes

Fix tests

41123f6

clockard force-pushed the fix-solr-field-reslover branch from a6cdfb8 to 41123f6 Compare June 5, 2026 16:09

jrnorth approved these changes Jun 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make solr field/schema resolvers respect the tokenized field of attributes#7003

Make solr field/schema resolvers respect the tokenized field of attributes#7003
clockard wants to merge 2 commits into
masterfrom
fix-solr-field-reslover

clockard commented Jun 2, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Jun 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jaymcnallie left a comment

Uh oh!

Uh oh!

jrnorth Jun 5, 2026

Uh oh!

jrnorth Jun 5, 2026

Uh oh!

jrnorth Jun 5, 2026

Uh oh!

jrnorth Jun 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

clockard commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Who is reviewing it?

Any background context you want to provide?

Notes on Review Process

Review Comment Legend:

Uh oh!

CLAassistant commented Jun 2, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jaymcnallie left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jrnorth Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

jrnorth Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

jrnorth Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

jrnorth Jun 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

clockard commented Jun 2, 2026 •

edited

Loading