Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Frequency Map #22

Open
GoogleCodeExporter opened this issue Jul 16, 2015 · 1 comment
Open

Frequency Map #22

GoogleCodeExporter opened this issue Jul 16, 2015 · 1 comment

Comments

@GoogleCodeExporter
Copy link

Good Afternoon,

How to generate a map of frequency of n-grams?

Thank you.

Original issue reported on code.google.com by [email protected] on 8 Dec 2014 at 4:48

@GoogleCodeExporter
Copy link
Author

Depends what you mean. If you want to get frequency of n-grams from raw text, 
we don't support that. If you want an efficient in-memory representation of 
some n-gram counts that you have already put in Google n-grams format, then you 
can build a StupidBackOffLm and get access to the underlying counts using:

https://code.google.com/p/berkeleylm/source/browse/trunk/src/edu/berkeley/nlp/lm
/StupidBackoffLm.java#132

See here for an example:

https://code.google.com/p/berkeleylm/source/browse/trunk/src/edu/berkeley/nlp/lm
/io/MakeNgramMapBinaryFromGoogle.java#40
'
Let me now if you need further help.

Original comment by [email protected] on 9 Dec 2014 at 1:35

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant