-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chore: Add ability to retry ingestion up to 3 times #1089
Chore: Add ability to retry ingestion up to 3 times #1089
Conversation
35eb6fb
to
5a229d1
Compare
Signed-off-by: Daishan Peng <[email protected]>
5a229d1
to
657be9d
Compare
@@ -258,7 +258,7 @@ func (k *Handler) Sync(req router.Request, _ router.Response) error { | |||
|
|||
invokeOpts := invoke.SystemTaskOptions{ | |||
CredentialContextIDs: []string{credentialContextID}, | |||
Timeout: 1 * time.Hour, | |||
Timeout: 2 * time.Hour, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Increase the time since we want to increase the number of pages our scraper can scrape and sometimes it can take up more than 1 hour to finish.
@@ -61,7 +61,7 @@ type Config struct { | |||
HelperModel string `usage:"The model used to generate names and descriptions" default:"gpt-4o-mini"` | |||
AWSKMSKeyARN string `usage:"The ARN of the AWS KMS key to use for encrypting credential storage" env:"OBOT_AWS_KMS_KEY_ARN" name:"aws-kms-key-arn"` | |||
EncryptionConfigFile string `usage:"The path to the encryption configuration file" default:"./encryption.yaml"` | |||
KnowledgeSetIngestionLimit int `usage:"The maximum number of files to ingest into a knowledge set" default:"1000" env:"OBOT_KNOWLEDGESET_INGESTION_LIMIT" name:"knowledge-set-ingestion-limit"` | |||
KnowledgeSetIngestionLimit int `usage:"The maximum number of files to ingest into a knowledge set" default:"3000" env:"OBOT_KNOWLEDGESET_INGESTION_LIMIT" name:"knowledge-set-ingestion-limit"` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Increasing the number of files one agent can ingest, since we should be able to ingest 3000 files per agent without any error.
Note: we haven't really systematically tested whether increasing the number of ingested file will deteriorate the retrival result. This is just the success rate of ingestion.
Add ability to re-enqueue the file ingestion process up to 3 times when it fails.
#1071