Oracle Language Model #316

lawhead · 2024-02-20T23:41:23Z

Overview

Added a new language model that has knowledge of the target copy phrase and boosts the probability of the next target.

Ticket

https://www.pivotaltracker.com/story/show/187083845

Contributions

Added new language model and tests

Test

Added unit tests
Ran Copy Phrase task using the Oracle LM

dcgaines

For the most part this looks good and pretty straightforward, just a couple of suggested changes/questions.

It may be beneficial to add a method to update the target text, so we don't have to reinitialize the LM for every task. Probably not a huge deal since we aren't actually loading a model
There should be some protections around the valid value range for target_bump (non-negative, some sort of max?).
Do we need to normalize the distribution in predict() before returning?
For the other LMs, backspace is not included in the distribution. The predict() method returns a probability of 0, and then it is set externally based on the parameters file. Can you run a test and check the session.json to make sure the oracle backspace isn't being overwritten by the static backspace_prob?

lawhead · 2024-02-21T19:52:20Z

@dcgaines, thanks for the feedback!

Good thinking on the target_bump range checks. I added those, as well as some setter validation to ensure that the task_text does not get a None value. I also added some unit tests to demonstrate how to update the target text.

The normalization happens in the with_min_prob function when the target is bumped.

The other LM that returns a probability for backspace is the UniformLanguageModel. The CopyPhraseWrapper will only replace the value for the backspace if the LM did not return one or if the configured value for the min backspace probability is larger than what was returned by the LM. Currently that min value is set to 0.0. I checked the session.json from my Copy Phrase session using the OracleLanguageModel and confirmed that the value provided by the LM was not overridden.

dcgaines · 2024-02-22T17:13:04Z

Thanks for those changes and checks. Another thing I just thought of is that the model should have some handling for if target_text == evidence. Theoretically, this shouldn't happen, since the phrase will be done, but it might be good to handle just in case, and it might be as simple as returning None from the next_target() function, so that nothing gets bumped and there is a uniform prob.

lawhead · 2024-02-22T17:26:15Z

Oh yes, good catch. I'll add some handling for that.

…eds the task_text length

dcgaines

Looks good, thanks!

#187083845 ; added Oracle Language Model

978c4ad

lawhead changed the base branch from main to 2.0.0rc4 February 20, 2024 23:41

lawhead requested a review from dcgaines February 20, 2024 23:42

dcgaines requested changes Feb 21, 2024

View reviewed changes

Added checks for valid property values; updated changelog

34a5e75

Updated OracleLanguageModel to handle the case when the evidence exce…

21aedb4

…eds the task_text length

dcgaines approved these changes Feb 23, 2024

View reviewed changes

lawhead merged commit efc21bb into 2.0.0rc4 Feb 23, 2024
6 checks passed

lawhead deleted the oracle-lm branch February 23, 2024 16:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Oracle Language Model #316

Oracle Language Model #316

lawhead commented Feb 20, 2024

dcgaines left a comment

lawhead commented Feb 21, 2024

dcgaines commented Feb 22, 2024

lawhead commented Feb 22, 2024

dcgaines left a comment

Oracle Language Model #316

Oracle Language Model #316

Conversation

lawhead commented Feb 20, 2024

Overview

Ticket

Contributions

Test

dcgaines left a comment

Choose a reason for hiding this comment

lawhead commented Feb 21, 2024

dcgaines commented Feb 22, 2024

lawhead commented Feb 22, 2024

dcgaines left a comment

Choose a reason for hiding this comment