Orchard corrupts Elastic stemmer configuration #16384

Lenar-Avia · 2024-06-28T13:03:26Z

Describe the bug

Customer Elastic analyzers configuration from OrchardCore_Elasticsearch block gets corrupt

Orchard Core version

1.8.2

To Reproduce

Steps to reproduce the behavior:

Go to 'OrchardCore_Elasticsearch' section in configuration
Try to set up simple stemmer analyzer, SPANISH for example, as explained at:
https://www.elastic.co/guide/en/elasticsearch/reference/current/analysis-lang-analyzer.html#spanish-analyzer
Make sure you have both "analyzer" and "filter" sections inside "analysis" block
Save and apply configuration
Check out current GET .../_settings of elastic - to find out there is no "analysis.filter" area.
i.e. actual settings now do not contain "filter" area (on the same level as "analyzer")
Check out the termvectors to see that no handling happened to substrings.

Expected behavior

I would expect the termvectors to be created using stemmer.
When directly setting config using PUT ../_settings , custom morphological analyzer can be applied.

Logs and screenshots

Please try to reproduce the following settings using OrchardCore_Elasticsearch:


"analysis": {
  "filter": {
	"spanish_stop": {
	  "type":       "stop",
	  "stopwords":  "_spanish_" 
	},
	"spanish_stemmer": {
	  "type":       "stemmer",
	  "language":   "light_spanish"
	}
  },
  "analyzer": {
	"default": {
	  "tokenizer":  "standard",
	  "filter": [
		"lowercase",
		"spanish_stop",
		"spanish_stemmer"
	  ]
	}
  }
}

The text was updated successfully, but these errors were encountered:

github-actions · 2024-06-28T13:03:52Z

Thank you for submitting your first issue, awesome! 🚀 We're thrilled to receive your input. If you haven't completed the template yet, please take a moment to do so. This ensures that we fully understand your feature request or bug report. A core team member will review your issue and get back to you.

If you like Orchard Core, please star our repo and join our community channels.

MikeAlhayek · 2024-06-28T22:13:40Z

@Lenar-Avia I am not sure I follow your steps. But in order to create rebuilt_spanish you referenced, your configuration should look like the following:

"OrchardCore_Elasticsearch": {
  // ...
  "Analyzers": {
    "rebuilt_spanish": {
      "tokenizer":  "standard",
      "filter": [
        "lowercase",
        "spanish_stop",
        "spanish_keywords",
        "spanish_stemmer"
      ]
    }
  }
}

Can you see if the above works for you? Here is a reference from our documentation

Lenar-Avia · 2024-07-02T14:36:41Z

Hello, dear!
Well if i supply the request as you have provided, without the Analysis.Filters area,
then exception happens when i try to rebuild the index.
definitions: "index_not_found_exception" , "no such index [index_name]"
Your request is incomplete without Filters.

Also you ignore the fact that only "default" analyzer is working in OrchardCMS config
(i.e. it cannot be called rebuilt_spanish).
Please try to get a working stemmer configuration before removing the bug tag..

MikeAlhayek · 2024-07-02T18:31:15Z

Filters is not something we support in OC. Feel free to submit a PR that would add filters support in addition to the analyzers.

github-actions · 2024-07-02T18:33:57Z

We triaged this issue and set the milestone according to the priority we think is appropriate (see the docs on how we triage and prioritize issues).

This indicates when the core team may start working on it. However, if you'd like to contribute, we'd warmly welcome you to do that anytime. See our guide on contributions here.

Lenar-Avia added the bug 🐛 label Jun 28, 2024

Lenar-Avia changed the title ~~Orchard corrups Elastic stemmer configuration~~ Orchard corrupts Elastic stemmer configuration Jun 28, 2024

MikeAlhayek added needs author feedback and removed bug 🐛 labels Jun 29, 2024

This was referenced Jul 1, 2024

Monthly community metrics report for 2024-06-01..2024-06-30 SGuidone/OrchardCore#45

Open

Monthly community metrics report for 2024-06-01..2024-06-30 #16387

Closed

MikeAlhayek added enhancement and removed needs author feedback labels Jul 2, 2024

MikeAlhayek added this to the 2.x milestone Jul 2, 2024

denispetrische mentioned this issue Oct 7, 2024

Add support of Elasticsearch Token Filters #16843

Merged

hishamco closed this as completed in #16843 Oct 17, 2024

MikeAlhayek modified the milestones: 2.x, 2.1 Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Orchard corrupts Elastic stemmer configuration #16384

Orchard corrupts Elastic stemmer configuration #16384

Lenar-Avia commented Jun 28, 2024 •

edited

Loading

github-actions bot commented Jun 28, 2024

MikeAlhayek commented Jun 28, 2024 •

edited

Loading

Lenar-Avia commented Jul 2, 2024

MikeAlhayek commented Jul 2, 2024

github-actions bot commented Jul 2, 2024

Orchard corrupts Elastic stemmer configuration #16384

Orchard corrupts Elastic stemmer configuration #16384

Comments

Lenar-Avia commented Jun 28, 2024 • edited Loading

Describe the bug

Orchard Core version

To Reproduce

Expected behavior

Logs and screenshots

github-actions bot commented Jun 28, 2024

MikeAlhayek commented Jun 28, 2024 • edited Loading

Lenar-Avia commented Jul 2, 2024

MikeAlhayek commented Jul 2, 2024

github-actions bot commented Jul 2, 2024

Lenar-Avia commented Jun 28, 2024 •

edited

Loading

MikeAlhayek commented Jun 28, 2024 •

edited

Loading