Speaker: Professor Alexander G. Hauptmann, School of Computer Science, Carnegie Mellon University
Since we have long been convinced that semantic analysis will be critical in the long term, we have been studying ways to train large numbers of semantic classifiers. We have been obtaining very encouraging results in obtaining large amounts of automatically harvested weakly-labeled training data from the web, as well as building an infrastructure that allows us to efficiently process the video data into semantic concept classifiers for objects, scenes, and actions. In turn, this has uncovered new challenges in how to use thousands of semantic detectors during browsing or querying.
This research was critical in the top performance of our team at this year’s TRECVID video analysis competition, as well as setting the stage for significant advances in coming years.
Alex Hauptmann is a Principal Systems Scientist in the Carnegie Mellon University Computer Science Department and a faculty member with CMU’s Language Technologies Institute. His research interests have led him to pursue and combine several different areas: man-machine communication, natural language processing, speech understanding and synthesis, machine learning. He worked on speech and machine translation at CMU from 1984-94, when he joined the Informedia project where he developed the News-on-Demand application. Since then he has conducted research on video analysis and retrieval on broadcast news as well as observational video with success documented by outstanding performance in the annual NIST TRECVID video retrieval evaluations. His current research centers on robust analysis of internet-style and surveillance video as large scale.
For further information, please visit: