Off-Policy Learning to Rank Codebase

This codebase is used to implement the results in paper Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective.

CUOLR: RL-based (SAC, CQL)
BASELINES: DLA, IPW, CM-IPW

Experiments

Prepare Click Data with SVM Ranker

Choose the data preprocess bash file based on the dataset you use: dataset/data_preprocess_{dataset-name}.sh, where dataset-name={istella_s, web10k, yahoo}. Then run the following code after you change Data_path in the first line to the root path of the dataset:

bash ./dataset/data_preprocess_{dataset-name}.sh

Run Experiments

All the experiments are in exps folder, each subfolder refers to an experiment in the paper accordingly. To run any experiment on any dataset, do the following steps:

Open exps/{exp-name}/{exp-data-name}/run.sh and change output_fold to exps/{exp-name}/{exp-data-name}
Open exps/{exp-name}/{exp-data-name}/run_json/run_svm.json and change dataset_fold to the root path of the dataset.
Run bash ./exps/{exp-name}/{exp-data-name}/run.sh

where exp-name={ablation_alphas, ablation_embed, baselines} and data-name={istella_s, web10k, yahoo}

Results and Analysis

Evaluation

Evaluation results on the test set can be seen in exps/{exp-name}/{exp-data-name}/results/performance.txt, with {err, ndcg}@{3,5,10} as metrics.

T-test

To run T-test, do the following steps:

Change result_path and output_path in main function in T_test/T_test_{exp-name}.py, where
- result_path: path to evaluation metric file.
- output_path: path to T-test result you want to store.
- exp-name: baseline or embed.
Run python ./T_test/T_test_{exp-name}.py

Plot

Plot is only needed in ablation study of conservatism. To plot the curves for different alphas under different click models, run the following code:

python plot/plot_alpha_ablation.py [arg1] [arg2]

where arg1 refers to root_file_path, path to the root folder of performance files; and arg2 refers to output_path, path to the output file to store plot figure.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.vscode		.vscode
T_test		T_test
clickModel		clickModel
dataset		dataset
demo		demo
exps		exps
libsvm_tools		libsvm_tools
network		network
plot		plot
propensityModel		propensityModel
ranker		ranker
runs		runs
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
environment.yml		environment.yml
myLTR.zip		myLTR.zip
read_performance.py		read_performance.py
svm_rank_classify		svm_rank_classify
svm_rank_learn		svm_rank_learn
svm_rank_linux64.tar.gz		svm_rank_linux64.tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Off-Policy Learning to Rank Codebase

Experiments

Prepare Click Data with SVM Ranker

Run Experiments

Results and Analysis

Evaluation

T-test

Plot

About

Releases

Packages

Languages

License

huazhengwang/Unified-Off-Policy-LTR-Neurips2023

Folders and files

Latest commit

History

Repository files navigation

Off-Policy Learning to Rank Codebase

Experiments

Prepare Click Data with SVM Ranker

Run Experiments

Results and Analysis

Evaluation

T-test

Plot

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages