How are the inputs look like #12

nemo0526 · 2023-03-28T12:19:42Z

The HowTo100M require a huge storage that I can't make a training. May I ask that how the videos and the texts look like before put into transformer module(like[31373, 11, 1312, 716, 36945...]or[0.0524,-0.0960,-0.1728...])? Do you directly contenate the tensor after feature extraction and put into transformer module without any adjustment?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How are the inputs look like #12

How are the inputs look like #12

nemo0526 commented Mar 28, 2023 •

edited

Loading

How are the inputs look like #12

How are the inputs look like #12

Comments

nemo0526 commented Mar 28, 2023 • edited Loading

nemo0526 commented Mar 28, 2023 •

edited

Loading