CATEGORIES
- WEB STARTUPS
- WEB NEWS
- CONFERENCES
- VENTURE CAPITAL
- MICROSOFT
- WEB TECH JOBS
- YAHOO
- ADVERTISING
- VIDEO
- ALL TOPICS
- ALL COMPANIES
CONTRIBUTORS
- ADRIAN CHAN
- ALICIA NAVARRO
- ALLEN STERN-EDITOR
- CORSIN CAMICHEL
- DARREN HERMAN
- HANK WILLIAMS
- MARK DAVIS
- RICK TUROCZY
- SANFORD DICKERT
- SHANNON CLARK
Hakia
Hakia Launches White Label Semantic Search
NY-based Hakia is announcing the launch of their white-label semantic Web search (they call it Syndication Web Services) today. The idea is simple - you can now add Hakia's services to your site offering more intense search functionality for your visitors.
The business model is welcoming; offer 30,000 searches per day free of charge and free of advertising then they will discuss a relationship with you past the 30,000. What this means is that small social networks may never pay anything for using the Hakia service but provide a great benefit for their users.
The Syndication Web Services include Web search, News search, Vertical search, Summarizer, Categorizer, Characterizer and Text Meaning Representation. The services provide an XML feed, and options to customize the feed. The first company using the new service is Berggi. Berggi has created a worldwide mobile search application.
This month Hakia added PubMed to their search index and also checkout our comparison of Hakia and Powerset.
Hakia Adds 10 Million PubMed Articles to Health Search
Hakia announced yesterday that they have added 10 million health articles from the U.S. government run PubMed search engine. You can now search PubMed from a special PubMed Hakia search or any nornal Hakia search will also return PubMed results. This announcement comes after Hakia announced the launch of the "credible health search".
The Hakia blog post on the PubMed announcement is well worth a read if you are into semantic search. Apparently the PubMed search has holes that may return incorrect or no results. Hakia uses their intelligence to return relevant results.
Check out our comparison of Valley darling Powerset vs. NY-based Hakia.
So How Many Search Engines Aren't Running Google's Ads?
With Yahoo signing away their advertising business today to Google, I thought it would be interesting to look at the major search engines and see who handles their advertising. Have a look at the list below and check out just how dominant Google is. You wonder why no one is using the other search engines -- it's easy, if the search engine can't sell ads, perhaps their technology isn't as good either, right? Please add other engines in the comments and I will add them to the list.
Hakia Adds Social Networking to their Scoopbar
NY-based semantic search provider Hakia is announcing an update to their Scoopbar toolbar. We originally profiled the Scoopbar at their demo in NYC last September. There's a demo of the Scoopbar functionality on the hakia Web site. The Scoopbar allows you to save searches and specific content from the search along with the ability to jump directly to the section of the selected search result with the terms you searched for.
Today's addition brings social networking into play with, "Meet Others." The Meet Others option allows searchers to find other hakia users who have searched for the same terms. From there, each person can contribute to the conversation around the search terms.
Here is how Hakia positions the new Meet Others service, " Meet Others is initiated by entering a query at hakia.com search engine, then clicking on the Meet Others button which opens a room of postings by others who asked the same or similar query. The user can also go directly to meet.hakia.com to search for query rooms. Contact links are available for each posting, but user information is hidden from view. Registration is not required to view rooms and contact people, but posting requires user authentication via email confirmation. The entire operation is optional to participate."
Media Darling Powerset vs. Non-Media Darling Hakia
Over the weekend, The web was abuzz with discussion about Microsoft considering the acquisition of natural language search company Powerset. Some time ago I had heard a rumor that someone was looking at Powerset, but was relatively uninterested. Hearing that the potential acquirer is Microsoft certainly makes it more interesting, but I have to say the concept leaves me more than a bit incredulous.
From skeptic to user
I became familiar with Powerset's only competitor, Hakia initially because they are a New York company. I became intrigued with Hakia because several months ago I tried their search engine, and it worked – really well. This was a surprising result for me since I have always been a skeptic regarding all things relating to artificial intelligence, speech recognition, natural language processing, and other such fuzzy technologies.
At least in the area of natural language processing Hakia that has changed my mind. In fact, it has become common for me to use the Hakia search engine when Google does not deliver sufficient results.
Hakia and Powerset are part of the same general area of natural language search. The idea with both services is that you can actually ask specific questions and get answers. But there are critical differences between Hakia and Powerset. And those differences bring me back to my incredulity at the idea that Microsoft is taking a serious look at Powerset.
Powerset indexes 750 times slower than Hakia!
I have no expertise in natural language processing or semantic search, or any type of full text search for that matter. But as far as I can tell, Hakia’s technology is *far* superior to that of Powerset’s. Why would I say that?
Well first, as I have already said, it works. It is a real live search engine. I use it. I can’t say the same for Powerset. Powerset has yet to show anything but a search engine for Wikipedia. A big part of the reason Powerset doesn’t seem able to offer a real search engine is the fact that according to their own reports, it takes them about 25 seconds to index a page, based on an average of 25 sentences per page. According to Hakia it takes them 1/30th of a second to index a page. Essentially this means that Powerset cannot scale. It is seven hundred fifty times slower than Hakia!
Now you might assume that Powerset is slower because it’s applying some serious, and superior indexing mojo, and therefore what it is doing is much more valuable than what Hakia is doing. But alas that is also not true.
Hakia really knows how to read
Hakia is doing something called “ontological semantics”. What this means is that over the last four years, Hakia has developed an “ontology” for human expression. In layman terms, what this means is that what Hakia does when it indexes a page is to look at each sentence and figure out what the *questions* are that each sentence answers. Any given sentence usually answers 3 or 4 questions. These questions are coded and go into what Hakia calls their Qdex, or question index.
In order to be able to figure out what the relevant questions are for a given sentence, Hakia’s indexer has to literally read the sentence. By “read” I mean it has to understand the actual meaning of the sentence semantically. This is a big deal.
Powerset uses statistics + syntax but can’t actually read
So, while Hakia is actually reading, Powerset, does not actually attempt to understand what sentences mean. It uses a system that parses the syntax of the sentence and guesses matches based on statistics. But this approach means that for questions that do not match previously encountered syntactical patterns, the system will not be able to find answers, even if there are in fact answers in the database.
Powerset benefits from the Silicon Valley echo chamber
Now, if, for a moment, you presume that it is true, or even *possibly* true that Hakia is the superior service and technology, or if you even assume that Hakia is just equivalent to Powerset, why would Powerset be so continuously celebrated while Hakia is overshadowed?
The only answer I can come up with is that the west coast is such an echo chamber that very little sound gets in or out. And so it must be shocking when a New York company develops a technology that seems to beat the pants off something that should be pure Silicon Valley. Just a thought.
In any case, it seems, for the record, worth noting that we have the clear leader in natural language processing and search technology right here. And, as an admitted New York partisan, after a while it does get a little annoying to hear such continued fawning over a west coast company that is very likely, at the end of the day, just another Silicon Valley also-ran.
This article was authored by Hank Williams who is a New York-based entrepreneur who recently launched a new blog: Why Does Everything Suck? exploring the tech marketplace from 10,000 feet.
Hakia Adds Credibility to Health Search
NY-based semantic search engine Hakia has announced the launch of what they are calling the, "Credible online health search." It appears to be a competitor to iMedix. Instead of showing the most popular results first, they are showing the ones that are the most credible. Hakia has published a list of the sources they deem credible. This is a step away from machine powered search and closer to human-based search.
Hakia describes the updates as, "To ensure credibility, hakia is providing results from sources that have met the quality criteria set forth by the Medical Library Association. hakia’s users can now trust what they find, because experts have vetted the sources."
I've said before that I think Hakia does a better job than Mahalo as it brings together human and machine search into a smarter page. Hakia needs to spend more time and effort on marketing now. They are competing with Google on the machine side, Mahalo on the human side, and now iMedix on the health side. All three competitors have serious money and some level of famous executives.
Hakia Picks Up Another $5 Million; Total $21 Million
NY-based semantic search engine Hakia is announcing a new round of funding today in the amount of $5 million bringing the total funding to $21 million to-date. The new investment comes from current shareholder Prokom Investments S.A. Hakia notes that the new funds will be instrumental to hakia.com as it builds a semantic advertising platform and a European datacenter.
We've written about Hakia before including their demo at the NY Tech Meetup which I have embedded below.
I stand firm that hakia is a better solution than Mahalo and could actually be stronger than Wikia Search hopes to be. It's unfortunate that they haven't found the right avenues for promotion and buzz generation so far. I also think they need to figure out marketing messages for the technical people and for the mainstream users.
A Few Mahalo Updates and Questions
This discussion is about four topics related to Mahalo (and the general Internet):
- Is Mahalo a clone of Hakia Gallery?
- Corporate social tool usage
- Where does Mahalo get their high Alexa ranking from?
- Positive employee feedback and encouragement from management
Mahalo: A Hakia Clone?
Last week I met with semantic search engine Hakia. One of the features they demo'ed to me is the "Hakia Gallery". The Gallery is a partially human-edited results for over several thousand results. The immediate reaction I had was "Is Mahalo a Hakia clone?" - do a search for cancer on both Hakia and Mahalo to see an example. Mahalo probably wins a point on design but gives up multiple points as the Hakia Gallery uses some human and some computer to create a much more robust page. So is Mahalo a clone? We already know that Jason knows how to clone... see Netscape.
Update: Based on my offline emails with Jason regarding this section, here is my clarification. I am in no way suggesting that Mahalo lifted any design elements from Hakia. I am merely noting that the pages are similar. I think Mahalo's design is easier on the eyes and Hakia's technology appears better by combining human-edited results plus their search technology. Also, please note that the link to Netscape being a Digg clone goes to TechCrunch.
Conversation with the Hakia Team: The ScoopBar Launch
Today I headed down to the financial district of NYC where the money is made. The area sure has changed since my time down there in the mid-90s. Hakia held an open demo for their ScoopBar which they demo'ed at the NY Tech Meetup this week. The open demo was at a local coffee shop near the Hakia office and there was a lot of foot traffic during my time with the team. This is such a great way to get local people to learn about your application. Hakia supports a local coffee shop, the local coffee shop supports Hakia, the community gets to learn about two companies making a difference in the community and it probably cost Hakia a couple hundred dollars in total. You might consider this type of community outreach when you launch your app. Alex at Read/WriteWeb has a great in-depth review of Hakia.
Here are my notes from the demo:
- The ScoopBar is great for anyone doing research - after installation, searching in the Hakia toolbar (i.e. Scoopbar) allows you to click a link and be taken to the actual Web site with the search highlighted (no need to find it) and then you can save the text (plus a buffer you select) to a local page on your hard drive. I could see this as a great tool for students and educators.
- Downloads are available for IE currently, Firefox version due out very soon
- A Web API is coming soon
- Downloads available through Download.com
Have you tried Hakia or the ScoopBar? If so, please report in!
Hakia Demo Review
Hakia was one of the presenters at the September NY Tech Meetup. They presented their new "scoop bar" which is a way to get content from a Web page and save it to a local file. Frankly I didn't get a full understanding of what this option in Hakia does but here is my attempt at an explanation. It appears that you do a search on Hakia, it provides results with the search query highlighted. Click the query and you jump to the Web page and the term is highlighted as well. Then you can click the scoop button on your browser (it's an add-on) and it asks you how much content from that page you want to save. Click save and it saves the selected text as a file on your hard drive. There were some questions around why it saves to the hard drive and not online. My question would be "what happens if the page changes" - for example, let's say it was a math equation (pi = 3.414x) and it changed, this document would not be updated and now you are left with a problem. If they offered an online repository that saved the content and then kept pinging for updates, that would be a better option. Could be a great researcher tool.
The video is waiting inside for you!

















