As a computer engineer love to hear your idea. NDA, or whatever. One of the reasons I started transcribing, was to make it easier to search the audio. Keep at it.
Sorry for such a delayed reply. Holy crap got through doing alot in the last few days. Had a wedding, moved all my stuff from Virginia to Louisiana, so I've been pretty much on the road for 2 days hah.
I have some contacts through my company that I was going to ping about it because they do audio content indexing for sure. I'm not sure what all they're using, and I'm sure some of it isn't going to be free. However they do have alot of contacts through universities and I think alot of the technology that we use at my job comes from universities. So who knows.
There is no questions asked once I finally get back to working on my application I'll be contacting you and anybody else to see if I can't build in the transcriptions into the "Survival Podcast Database" that I have been putting together.
I have already selected my text searcher and boy can I not wait to unleash it. I plan on using the Whoosh Python library. Right now it'll just be able to do powerful searches across the text, description and perhaps even the URLs. Also for us power users I'll have the ability to do regular expression searching as well. I actually sent an email to jack asking if he could start including just some basic keywords with the podcasts that really hit the main points of the audio content in an easy and trustable parsing way. I'm thinking about calling up as an extra incentive.... I probably wasn't clear enough with my intentions to improve searching capabilities. As of right now the current searching technique is pretty awful.
I'd like to be able to go "steve harris" and REALLY get steve harris hits, ordered by match hits, perhaps even weighted depending on location (title vs html content).
Nearly all of my application is complete the only thing really left at this point is doing GUI stuff. I need to look around the forums for a good place to announce my project.... I'll probably wait till i actually sit down for a weekend and crank out more code that way I feel like i'm not leaving people hanging.