I'm CJ, a PhD student in HKUST, currently engaged in a project involving a substantial volume of multiple videos featuring a speaker addressing an audience, over 1000 hours in total.
I'm seeking advice on primers, methods, or tools that could effectively and efficiently handle the coding and analysis of such large volumes of video data. Given the scale of the data, manual coding seems implausible.
Each video also contains distinct textual elements that are separate from the subtitles. Therefore, I'm seeking suggestions on tools or methods that can effectively analyze the speech, as well as extract and analyze these embedded textual elements from the videos, in conjunction with the speech data.
If you could kindly share your insights or experiences, please send an email to firstname.lastname@example.org.
Looking forward to learning from the community!
CJ RheePhD Candidate, Department of Management, Hong Hong Univ. of Science & Tech.