Skip to content

Commit

Permalink
feat: README
Browse files Browse the repository at this point in the history
  • Loading branch information
lukasellinger committed Oct 16, 2024
1 parent b144bd8 commit 2efe79f
Show file tree
Hide file tree
Showing 3 changed files with 61 additions and 7 deletions.
63 changes: 58 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,21 +1,47 @@
# Evaluating Factuality Word Definitions

## Table of Contents
- [Setup](#setup)
- [Python Environment Setup](#1-python-environment-setup)
- [Install Required Packages](#2-install-required-packages)
- [Configure Project Settings](#3-configure-project-settings)
- [Discourse Simplification Repository Setup](#4-discourse-simplification-repository-setup)
- [Usage](#usage)
- [Repository Structure](#repository-structure)
- [Data](#data)
- [Download Evaluation & Dataset Data](#download-evaluation--dataset-data)
- [Download Wiki Pages (Optional)](#download-wiki-pages-optional)
- [Authors and Acknowledgments](#authors-and-acknowledgments)

## Setup

To set up the project environment, follow these steps:

1. Install the required Python packages and models by running:
### 1. Python Environment Setup
Make sure you have Python 3.10 or above installed.
It's recommended to use a virtual environment to avoid conflicts with other projects.
You can create and activate a virtual environment by running:
```
python -m venv evaluating-word-definitions
source evaluating-word-definitions/bin/activate # On Windows use: evaluating-word-definitions\Scripts\activate
```

### 2. Install Required Packages
Once the virtual environment is activated, install the necessary dependencies:
```
pip install -r requirements.txt
python setup.py
```
2. Setup config file and add necessary attributes:
### 3. Configure Project Settings
To properly configure the project, set up the configuration file:
```
cp config.py.template config.py
nano config.py
nano config.py # Use any text editor to modify config.py
```
Ensure to add all necessary attributes in config.py, such as API keys, model paths, or dataset locations.

3. Repository for Discourse Simplification
### 4. Discourse Simplification Repository Setup
For the discourse simplification component, clone the required repository and check out the specific commit as follows:
```
cd ..
git clone [email protected]:Lambda-3/DiscourseSimplification.git
Expand All @@ -24,5 +50,32 @@ git checkout 5e7ac12
mvn clean install -DskipTests
```

## Authors and acknowledgment
## Usage
TODO show pipeline object
and notebooks

## Repository Structure
add explainations where to fidn everything

## Data
The evaluation data is too large to be stored in this Git repository.
However, downloading it is not required to run the pipeline.
If needed, you can manually download the data from the following links:
[Link to Data](https://drive.google.com/drive/folders/1Vj15MWmNMzld7odGNZ9h7B9qrn4OKehV?usp=drive_link)

### Download Evaluation & Dataset Data
To download the evaluation dataset, use gdown to fetch the data from Google Drive:
```bash
gdown --folder "https://drive.google.com/drive/folders/1E8Vi6zmTDldCdWHrWReVx5teMtt0cite?usp=drive_link" -O data
```
This will download all the necessary files to the data/ folder.

### Download Wiki Pages (Optional)
We also provide the Wiki Pages used in the FEVER task.
If you need them for your analysis, you can download the Wiki Pages using the following command:
```bash
gdown --folder "https://drive.google.com/drive/folders/1FUfz6101wAFPWUEyEMPhyHJegA3uHcZM?usp=drive_link" -O wiki-pages
```

## Authors and Acknowledgments
Lukas Ellinger ([email protected])
3 changes: 2 additions & 1 deletion requirements.txt
Original file line number Diff line number Diff line change
Expand Up @@ -21,4 +21,5 @@ openai~=1.35.10
wandb~=0.17.5
flask~=3.0.3
flask_sse~=1.0.0
huggingface-hub~=0.24.5
huggingface-hub~=0.24.5
gdown~=5.2.0
2 changes: 1 addition & 1 deletion scripts/sqlite_related/create_wikipages_db.py
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@
with FeverDocDB() as db:
db.write(CREATE_DOCUMENTS)

wiki_pages_dir = PROJECT_DIR.joinpath('data/wiki-pages')
wiki_pages_dir = PROJECT_DIR.joinpath('wiki-pages')
wiki_pages = os.listdir(wiki_pages_dir)
reader = JSONLineReader()

Expand Down

0 comments on commit 2efe79f

Please sign in to comment.