Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is there a version that doesn't require watching a video please? This would be 10x faster and easier as a text blob


yes, you must save it to a laserdisc, and then observe it though a highly magnified digital microscope looking for a specific frame.


https://wiki.techtangents.net/wiki/Seeing_Media

writeup from the author linked in the video description


this should be the main link


No!


Most people would get a "Making sure you're not a bot" anime girl with that link.


It flashed too briefly for me to understand what I was seeing.


Thank you! So much more helpful.


It took me like 2 minutes to find the relevant part:

https://youtu.be/qZuR-772cks?si=rYM4EjvV7VeTEzx8&t=1570


You can ask your human to watch the video and write the text blob for you.


Just post it to social media to generate the text description for free (multiple humans in parallel work better)


Short answer: Skip to 22:45

Long answer: Just jump around. Once you get to the last 1/3rd of the video, there's a lot of close ups of the part of the laserdisc that recorded the credit sequence. The on-screen (laserdisc) text is clearly visible.

The author moves a flashlight around to show how the angle is important; something that won't come across as well in a blog entry.


[flagged]


It's likely based on just the transcript, even if it describes visual things, it likely guesses those things from the transcript text only.

Maybe it's better now, but that was how it did it recently. To be convinced that it "watches" the video, I would need to see evidence of it referring to facts that are strictly only possible to know from the video, but not guessable from the audio.


You can try it with your own recorded video. I record myself doing exercises and Gemini gives me really good feedback on my form.


use gemini and ask it to summarize a youtube link




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: