-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DataOutput.writeGroupVInts throws IntegerOverflow exception during merging #13373
Comments
Below code snippet is from 9_10 branch where this issues has been observed. As per the latest change for 10, we have moved few set of lines from below method to other class into a new method. lucene/lucene/core/src/java/org/apache/lucene/store/DataOutput.java Lines 337 to 356 in f12e489
|
Sorry for missing the email list, It seems the |
Hi @easyice, I am the original reporter on the mailing list. As the code around indexing is a bit abstracted it might be hard to follow. What I do have, is the index that failed merging it is however, 173 GB xz compressed. I could use luke or a tool like that to extract more information for the lucene team. The fieldtype that we are indexing into is UNSTORED_POSITIONAL.setOmitNorms(true);
UNSTORED_POSITIONAL.setIndexOptions(IndexOptions.DOCS_AND_FREQS_AND_POSITIONS);
UNSTORED_POSITIONAL.setStored(false);
UNSTORED_POSITIONAL.setTokenized(false);
UNSTORED_POSITIONAL.freeze();``` Then we add fields like so doc.add(new Field("type", value.toLowerCase(Locale.US), UNSTORED_POSITIONAL); With over 1,177,800,000 documents in this index, all with the term "positional" at least once in their documents. So to create local sample data I would just do ;) for (int i=0;i<2_000_000_000;i++){
{
Document doc = new Document();
doc.add(new Field("type", "number", UNSTORED_POSITIONAL);
if (i % 2 == 0} {
doc.add(new Field("type", "even", UNSTORED_POSITIONAL);
} else {
doc.add(new Field("type", "un-even", UNSTORED_POSITIONAL);
}
writer.addDocument(doc);
} |
Thank you @JervenBolleman , I have found the cause of the issue with @gf2121 , i will raise a PR later. |
Here is the Thank you for reporting this @iamsanjay! It looks like it was a real bug, phew, and somewhat serious (not sure). |
Description
As being discussed on email list that
DataOutput.writeGroupVInts
throws as IntegerOverflow exception. The goal is to find out the main reason and also to improve the exception message.More context from the reporter
Version and environment details
No response
The text was updated successfully, but these errors were encountered: