Discussion about baselines for CodRep #13

mallamanis · 2018-04-12T18:01:55Z

Hi,
Thanks a lot for organizing this :) Hope that you don't mind the drive-by issue submission: I would like to suggest three additional, strong, but reasonable, baselines:

Random prediction over the lines where after the replacement the code still parses;
The line that is the most similar to the line being added (e.g. max % common tokens between the lines);
The combination of the above.

The reason I am suggesting this, is that these baselines seem easy "hacks" to achieve reasonable performance without any machine learning.

egor-bogomolov · 2018-04-12T18:08:05Z

Sounds reasonable. For now I've implemented second one and halfway in implementing first/third (halfway means that it's implemented but works slowly and not debugged properly).
Second idea on it's own gives around 85% accuracy.

egor-bogomolov · 2018-04-12T18:09:14Z

Small correction, I matched prefix and suffix instead of tokens.

chenzimin · 2018-04-12T19:26:34Z

Hi all,

Thank you for sharing ideas to improve the competition.

These baseline algorithms that I included in this competition are really "stupid". I did not include other baselines since I do not know how these would perform compared with your algorithms. The baseline algorithms should be easily beaten and easily understood.

If, however, that everyone seems to be able to beat the proposed baseline algorithms. And the idea of "The line that is the most similar to the line being added" or "the code still parses", is commonly adopted. Then I would add the proposed baseline algorithms to the set of baseline algorithms in the competition.

monperrus · 2018-04-13T14:27:31Z

Very nice to see you here @mallamanis!

Hope that you don't mind the drive-by issue submission

Oh no, that's great to have a lively issue tracker for discussing all sorts of things!

mallamanis · 2018-04-15T10:38:19Z

Hi all,
Thanks for your prompt responses :)

My rationale for suggesting those baselines was that filtering the lines where parsing fails would be a constraint that I would bake-in to any model to filter obvious false positives. The "most similar line" baseline would be the first "sanity check" to check if an ML model captures something beyond line similarity (such as context around each line) which seems a somewhat strong predictor.

monperrus · 2018-04-15T14:10:52Z

The "most similar line" baseline would be the first "sanity check" to check if an ML model captures something beyond line similarity (such as context around each line) which seems a somewhat strong predictor.

100% agree.

The corollary is that this dataset and loss function helps to see what ML techniques are good at capturing line similarity.

mallamanis · 2018-04-16T22:01:30Z

Agreed, although I am sure that capturing some information about the context will also useful.

monperrus · 2018-06-27T08:48:19Z

No more activity here, closing the issue. Don't hesitate to reopen if appropriate!

monperrus changed the title ~~Baselines~~ Discussion about baselines Apr 13, 2018

monperrus changed the title ~~Discussion about baselines~~ Discussion about baselines for CodRep Apr 13, 2018

monperrus mentioned this issue Apr 15, 2018

Participant %2: Allamanis et al., Microsoft Research #14

Open

monperrus closed this as completed Jun 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion about baselines for CodRep #13

Discussion about baselines for CodRep #13

mallamanis commented Apr 12, 2018

egor-bogomolov commented Apr 12, 2018

egor-bogomolov commented Apr 12, 2018

chenzimin commented Apr 12, 2018 •

edited

Loading

monperrus commented Apr 13, 2018

mallamanis commented Apr 15, 2018

monperrus commented Apr 15, 2018

mallamanis commented Apr 16, 2018

monperrus commented Jun 27, 2018

Discussion about baselines for CodRep #13

Discussion about baselines for CodRep #13

Comments

mallamanis commented Apr 12, 2018

egor-bogomolov commented Apr 12, 2018

egor-bogomolov commented Apr 12, 2018

chenzimin commented Apr 12, 2018 • edited Loading

monperrus commented Apr 13, 2018

mallamanis commented Apr 15, 2018

monperrus commented Apr 15, 2018

mallamanis commented Apr 16, 2018

monperrus commented Jun 27, 2018

chenzimin commented Apr 12, 2018 •

edited

Loading