tag:blogger.com,1999:blog-8833212223215831811.post40561049103898905..comments2023-04-30T01:04:58.091-07:00Comments on DTC356 Class - Tom: Blog Post #2 - Weinberger Prologue, Ch. 1Tomhttp://www.blogger.com/profile/18364902029751864219noreply@blogger.comBlogger4125tag:blogger.com,1999:blog-8833212223215831811.post-41424187160640886292011-01-30T13:10:36.462-08:002011-01-30T13:10:36.462-08:00Semantic search technology really is only as good ...Semantic search technology really is only as good as the data going into it - if you are informing or training the system with a small corpus, it'll perform well with that corpus but not much else. What any engine of that sort needs to know is how the words that appear most frequently relate to each other. It can "learn" that in two ways - by analyzing the frequency of co-occurrence of words (by sentence grouping, etc) and by watching how often its search results are used by a user. If it makes an informed guess (by throwing together a list of associations ordered by weight descending), and the user encourages it by clicking the link, it strengthens the associations that it used to make the guess.<br /><br />It's a long way from that sort of engine to the actual semi-intelligent stuff that Google's working with/on, though they're (at least in principle) the same. The difference is really the astronomically huge database that Google has to work with and the fact that they have hundreds of actual geniuses working on the problem. =)Tomhttps://www.blogger.com/profile/18364902029751864219noreply@blogger.comtag:blogger.com,1999:blog-8833212223215831811.post-61580985443140495642011-01-30T12:42:34.845-08:002011-01-30T12:42:34.845-08:00Great post Tom. If others were smart, they'd &...Great post Tom. If others were smart, they'd "borrow" your notes cliff-notes style. Also, given you've actually written a semantically based search engine before, you're coming at this from an entirely different perspective than most of us. Feel free to pipe up in class at any point if it seems I'm full of it. I'm all theory on this one, and zero practice.kristinhttps://www.blogger.com/profile/16589066910958948930noreply@blogger.comtag:blogger.com,1999:blog-8833212223215831811.post-43947935981804902112011-01-25T21:26:55.446-08:002011-01-25T21:26:55.446-08:00I agree with your point about weight of the data w...I agree with your point about weight of the data we produce in modern life exceeding our capacity to organize it. I have a folder with pictures from two deployments and a lot of them are not label so when I want to see pictures when I was in Baghdad I can’t find them because I failed to label the pictures correctly. So I have to open the folder which holds the pictures from Iraq and I almost have to browse to them one by one to find what I’m looking for. Sometime I don’t feel like have thrown them away since I can’t find what I’m looking for. <br />I have to agree with you that Weinberger loves the concept of miscellaneous and organization. I agree with his idea of how we strive to have everything organize and work very hard at it. We live in a fast pace world that if we are not organized time slips away.AlexanderManueleshttps://www.blogger.com/profile/03649930214624890419noreply@blogger.comtag:blogger.com,1999:blog-8833212223215831811.post-46718019146082005872011-01-25T17:29:33.570-08:002011-01-25T17:29:33.570-08:00It is very interesting how you tied in the key poi...It is very interesting how you tied in the key points of the book, with your own personal life. Your experience building a search engine and working on web servers must give you a different, more technical perspective on Weinberger's book. All those projects sound like a lot of work. I have a simular project, building a php based dynamic website, and I understand how it is hard to finish them, but I like your goal to complete at least a couple of them, in order to minimize disorder.Lane Quadraccihttps://www.blogger.com/profile/08038984172922083484noreply@blogger.com