Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chore: Add ability to retry ingestion up to 3 times #1089

Merged

Conversation

StrongMonkey
Copy link
Contributor

@StrongMonkey StrongMonkey commented Dec 31, 2024

Add ability to re-enqueue the file ingestion process up to 3 times when it fails.

#1071

@StrongMonkey StrongMonkey force-pushed the add-file-ingestion-retry branch from 35eb6fb to 5a229d1 Compare January 2, 2025 20:22
@StrongMonkey StrongMonkey force-pushed the add-file-ingestion-retry branch from 5a229d1 to 657be9d Compare January 2, 2025 20:50
@@ -258,7 +258,7 @@ func (k *Handler) Sync(req router.Request, _ router.Response) error {

invokeOpts := invoke.SystemTaskOptions{
CredentialContextIDs: []string{credentialContextID},
Timeout: 1 * time.Hour,
Timeout: 2 * time.Hour,
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Increase the time since we want to increase the number of pages our scraper can scrape and sometimes it can take up more than 1 hour to finish.

@@ -61,7 +61,7 @@ type Config struct {
HelperModel string `usage:"The model used to generate names and descriptions" default:"gpt-4o-mini"`
AWSKMSKeyARN string `usage:"The ARN of the AWS KMS key to use for encrypting credential storage" env:"OBOT_AWS_KMS_KEY_ARN" name:"aws-kms-key-arn"`
EncryptionConfigFile string `usage:"The path to the encryption configuration file" default:"./encryption.yaml"`
KnowledgeSetIngestionLimit int `usage:"The maximum number of files to ingest into a knowledge set" default:"1000" env:"OBOT_KNOWLEDGESET_INGESTION_LIMIT" name:"knowledge-set-ingestion-limit"`
KnowledgeSetIngestionLimit int `usage:"The maximum number of files to ingest into a knowledge set" default:"3000" env:"OBOT_KNOWLEDGESET_INGESTION_LIMIT" name:"knowledge-set-ingestion-limit"`
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Increasing the number of files one agent can ingest, since we should be able to ingest 3000 files per agent without any error.

Note: we haven't really systematically tested whether increasing the number of ingested file will deteriorate the retrival result. This is just the success rate of ingestion.

@StrongMonkey StrongMonkey merged commit ba4ccaf into obot-platform:main Jan 3, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants