Incorporate reranking for improved dataset retrieval and automatic dataset transformation for reduced manual effort
LatestWhat's Changed
The main changes are that:
- Dataset retrieval has been improved significantly, by using an LLM as a reranker! This should hopefully give better retrieved datasets with less manual configuration required (e.g. no need to manually specify which of the top-N datasets and configs to choose from)
- Automatic dataset transformation is here, which means that you can get more out of your retrieved dataset than was previously possible
- Dataset Reranking by @ritugala in #383
- Add data transformation capability to dataset retrieval step by @saum7800 in #385
- Improving Transform and Rerank Module by @ritugala in #396
- Migrate to the latest openAI version by @saum7800 in #397
- Add mistral fine-tuning and examples by @saum7800 in #395
New Contributors
Full Changelog: v0.0.9...v0.1.0