Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 363 Bytes

README.md

File metadata and controls

19 lines (13 loc) · 363 Bytes

Raw-Py150 dataset for Code Completion task

Step 1: download raw-py150 dataset

bash dataset/raw_py150/download.sh

Step 2: flatten attributes of raw-py150 files

python -m dataset.raw_py150.attributes_cast

Step 3: preprocess raw-py150 files into code tokens

python -m dataset.raw_py150.completion.preprocess