Search engine are not doing search anymore… No search is for small player, now the real thing is to have an evolved engine. Where Wolframalpha offer a computational knowledge engine, Bing offer a decision engine. So far Google can keep being Google fearless ; )

Bing (also known as Kumo) should be publicly available on June 3rd. At least the name is easy to remember :)

You can already get some information on it:
- Behind bing
- Decision engine

I personally think it is a good idea to not compete with Google directly on traditional search, they real do a great job there. Live.com showed to be quite good but did not had any motivational factor to change people habits. Bing, might succeed there, I guess most of the people will be positively surprised by something else than search : )

Enjoy soon,
Ahmet

 

Since I started my new job I have been so much involved in discovering new technologies that I quite forgot to post any significant and regular update to this website. Google’s ranking algorithm did not pardon this move: my page rank fall from 7 to 4 !!!

With a page rank of 7 I had the feeling to be over ranked but with a page rank of 4 I really have the feeling to be under ranked :)

To avoid continuing this move down, I have decided to:
1. Create a Twitter account and publish my tweets here to make the page change more often.
2. Take more time to write on this blog or decide to let it down :o
3. Refine the focus of this blog, obviously it was oriented web technologies now it is heading to software development and testing.

If you want to help me and share some of your insight, I am looking forward to read your advice below!

Ahmet

 

Microsoft in its fight for search market share has finally acquire Powerset (a natural language processing system). The system is really impressive and could be a real value for users. For example if you type in ‘How tall is the Eiffel Tower’ you will have the answer: 300.65 Meters

According to an interview in TechCrunch the Microsoft search team will implement Powerset’s functionalities in their search engine (Live.com) by the end of this year.

Below is a simple explanation of how it works (by Barney Pell):

[...]I guess one way to think about it is like when you are learning how to diagram sentences in elementary school. You draw these trees of a sentence and find here is the noun phrase and a noun phrase has a determiner like “the” and then it has a noun like “dog” and here is a verb phrase, and it might have a verb like “barks” and then what does it mean for the that word, bark is a verb and it has a “S” at the end and the way that it works, which we call morphology, that’s the present tense of that verb. And then the whole sentence is composed of those pieces, and so the meaning is built out of those. So you draw these diagrams when you are learning how to do it. And the kind of knowledge that’s in a natural language processing system like Powerset is using is sort of like that. Its basically extracting out both the surface structure, that kind of a tree structure of a sentence, and then its converting that into a series of different representation, ultimately into one which expressing that thing in fact. So it will basically say that there is a kind of activity here and it is a barking activity and the thing that is doing that activity, the subject of that activity, is a dog. Ok. So it is going from that sort of a surface structure of the language that you are seeing and converting it into a semantic factor representation. In addition, it is then able to draw on the individual meaning and relationships between words so if you saw that the sentence said “The poodle barks.” Then the system knows, if it can draw upon other knowledge about the relationship between words, as Powerset does, that poodles are a kind of dog. So if you as the user were able to say, “I want dogs barking” then it can actually then match the concept of dog to the concept of poodle and it is matching barking to barking and it is then doing this sort of semantic match for you which uses words you are not even using in your query and matching those against the document.[...]

Then Ramez Naam gave some more explanation:

[...]So one of the things that Powerset brings that is unique is the ability to apply their search technology to the query to the user’s search in ways that are beyond just the simple pluralization or adding an “-ing” is that Powerset also looks at the document, it looks at the words that are on a web page and this is actually very important. If you look at just the users query, what you have available to you to figure what they are talking about are three words four words five words, maybe even less. That can give you certain hints. If you look at a web page that has hundreds or thousands of words on it you have a lot more information you can use if you understand it linguistically to tell what its about, what kind of queries it should match and what kind of queries it shouldn’t match[...]

A quick note about given answers by Powerset:

We return answers. We actually synthesize, so if you were to say, “What did Tom Cruise star in,” you actually get not just the movies, but the cover art for the different movies. It synthesizes multiple pieces of information to give you a whole different kind of presentation. Or, if you were just to say, “Bill Gates” you’d be given an automatically generated profile of Bill Gates, pulled across many, many articles. It’s no longer just about 10 links, although we can certainly do more relevant job (and will) of the blue links, and a better job of presenting those links. With the language understanding systems which we now have, we can go way beyond that and open up a whole new door in user experience until you think, “oh god, that’s how I used to search, now I want this whole new different kind of thing.”

It is also interesting to note that Microsoft will use natural language processing powered by Powerset in Word for grammar and not only in the Live search engine.

Now Google doesn’t have to worry yet but I bet that they are also working hard on natural language processing…

A.

 

I’ve just read via the AXNA that Adobe, Google and Yahoo! worked together to allow SWF searchability. My first thought was ‘finally they made it’ :)

But while reading the FAQ I’ve been a little disappointed by how this indexation seems to happen:

[...]The Flash Player technology, optimized for search spiders, runs a SWF file similarly to how the file would run in Adobe Flash Player in the browser, yet it returns all of the text and links that occur at any state of the application back to the search spider, which then appears in search results to the end user[...]

FlashFlex

[...]All of the extracted information is indexed for relevance according to Google and Yahoo!’s algorithms. The end result is SWF content adding to the searchable information of the web page that hosts the SWF content, thus giving users more information from the web to search through[...]

So I’m afraid that it means that there will be absolutely ‘semantic’ information on the indexed tag. Is this text a title, a paragraph or an image caption? Does this mean that the SEO effort we have to do on SWF are only based on text and keywords?

If you have text with an Alpha of 0 but that have a tween, will it be indexed?

I’m very happy that A_G_Y are working to offer a better indexation and searchability of SWF but as far as I understand it, they are completely missing the SEO point here.

What do you think?

Ahmet

 

Must have been the first time I saw one of Google services being down…

Adsense is down

http://www.google.com/errors/asfe/system_down.html

Now, who is going to pay for the adds not distributed :) ?

Ahmet

 

Matt Cutts, the head of Google’s Webspam team has published on his blog his session at Web 2.0 Expo:

You can access all the video from the expo on Blip.tv

It is interesting to note that Google have no problem with SEO and that the politic again (web) spammer is to make them lose time, effort and frustrated them :)

You can also view the slide from his presentation directly on his post.

Ahmet