Dataset generated by the methods in "What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision"
Dataset (zip ~7.5 MB): Dataset (zip)
This dataset contains comma delimited data containing: YouTube video id, start time in milliseconds, end time in milliseconds, classified action, classified object (may be empty).
This data is released by Google under the following license:
This work is licensed under a Creative Commons Attribution 4.0 International License.