Saturday, September 24, 2011

Converting All Input to Text

I thought on this and decided maybe the whole system needs to be able to pick out memes by looking at everything it sees as text so that the semantic content can be extracted with a tool like ResearchCyc.

So how do we convert everything to text?
  1. Text is already text. Lucky!
  2. Audio can be converted with open-source speech recognition tools. Example: CMUSphnyx
  3. Convert videos to text by sampling each frame through open-source OCR, and grabbing all text visible at each frame and noting it in some kind of markup so we might be able to piece together the subtitles in a paragraph and the signage in each frame or "movie set" as a separate (and potentially useful) idea/meme.  (Does this mean we need to think of "place" as a factor in memes?)
 This is how we can absorb everything the internet can dish out.  Of course, this is all just a thought-experiment until we piece it all together, and the state of every open source package is subject to great capability variation (i.e. we might not be able to use the example packages or any other tools found on Sourceforge etc.

No comments:

Post a Comment