ALL >> Search-Engine-Optimization >> View Article
Search Evolution And Seo At Google-00-1732
Now, you and I may think that this "letter T" hack is really a silly idea, but how can we know for sure? Search evaluation is charged with answering such questions. This hack hasn't really come up, but we are constantly evaluating everything, which can include:
* proposed improvements to segmentation of Chinese queries
* new approaches to fight spam
* techniques for improving how we handle compound Swedish word
* changes to how we handle links and anchortext
* and everything in between
Not surprisingly, we take search evaluation very seriously. Precise evaluation enables our teams to know "which way is up". One of our tenets in search quality is to be very data-driven in our decision-making. We try hard not to rely on anecdotal examples, which are often misleading in search (where decisions can affect hundreds of millions of queries a day). Meticulous, statistically-meaningful evaluation gives us the data we need to make real search improvements.
Evaluating search is difficult for several reasons.
* First, understanding what a user really wants when ...
... they type a query — the query's "intent" — can be very difficult. For highly navigational queries like [ebay] or [orbitz], we can guess that most users want to navigate to the respective sites. But how about [olympics]? Does the user want news, medal counts from the recent Beijing games, the IOC's homepage, historical information about the games, ? This same exact question, of course, is faced by our ranking and search UI teams. Evaluation is the other side of that coin.
* Second, comparing the quality of search engines (whether Google versus our competitors, Google versus Google a month ago, or Google versus Google plus the "letter T" hack) is never black and white. It's essentially impossible to make a change that is 100% positive in all situations; with any algorithmic change you make to search, many searches will get better and some will get worse.
* Third, there are several dimensions to "good" results. Traditional search evaluation has focused on the relevance of the results, and of course that is our highest priority as well. But today's search-engine users expect more than just relevance. Are the results fresh and timely? Are they from authoritative sources? Are they comprehensive? Are they free of spam? Are their titles and snippets descriptive enough? Do they include additional UI elements a user might find helpful for the query (maps, images, query suggestions, etc.)? Our evaluations attempt to cover each of these dimensions where appropriate.
* Fourth, evaluating Google search quality requires covering an enormous breadth. We cover over a hundred locales (country/language pairs) with in-depth evaluation. Beyond locales, we support search quality teams working on many different kinds of queries and features. For example, we explicitly measure the quality of Google's spelling suggestions, universal search results, image and video searches, related query suggestions, stock oneboxes, and many, many more.
To get at these issues, we employ a variety of evaluation methods and data sources:
* Human evaluators. Google makes use of evaluators in many countries and languages. These evaluators are carefully trained and are asked to evaluate the quality of search results in several different ways. We sometimes show evaluators whole result sets by themselves or "side by side" with alternatives; in other cases, we show evaluators a single result at a time for a query and ask them to rate its quality along various dimensions.
* Live traffic experiments. We also make use of experiments, in which small fractions of queries are shown results from alternative search approaches. Ben Gomes talked about how we make use of these experiments for testing search UI elements in his previous post. With these experiments, we are able to see real users' reactions (clicks, etc.) to alternative results.
Clearly, we can never measure anything close to all the queries Google will get in the future. Every day, in fact, Google gets many millions of queries that we have never seen before, and will never see again. Therefore, we measure statistically, over representative samples of the query-stream. The "letter T" hack probably does improve a few queries, but over a representative sample of queries it affects, I'm confident it would be a big loser.
One of the key skills of our evaluation team is experimental design. For each proposed search improvement, we generate an experiment plan that will allow us to measure the key aspects of the change. Often, we use a combination of human and live traffic evaluation. For instance, consider a proposed improvement to Google's "related searches" feature to increase its coverage across several locales. Our experiment plan might include live traffic evaluation in which we show the updated related search suggestions to users and measure click-through rates in each locale and break these down by position of each related search suggestion. We might also include human evaluation, in which for a representative sample of queries in each locale, we ask evaluators to rate the appropriateness, usefulness, and relevance of each individual related search suggestion. Including both types of evaluation allows us to understand the overall behavioral impact on users (via the live traffic experiment), and measure the detailed quality of the suggestions in each locale along multiple dimensions (via the human evaluation experiment).
Choosing an appropriate sample of queries to evaluate can be subtle. When evaluating a proposed search improvement, we consider not only whether a given query's results are changed by the proposal, but also how much impact the change is likely to have on users. For instance, a query whose first three results are changed is likely much higher impact than one for which results 9 and 10 are swapped. In Amit Singhal's previous post on ranking, he discussed synonyms. Recently, we evaluated a proposed update to make synonyms more aggressive in some cases. On a flat (non-impact-weighted) sample of affected queries, the change appeared to be quite positive. However, using an evaluation of an impact-weighted sample, we found that the change went much too far. For example, in Chinese, it synonymized "small"and "big" not a good idea!
We're serious about search evaluation because we are serious about giving you the highest quality search experience possible. Rather than guess at what will be useful, we use a careful data-driven approach to make sure our "great ideas" really are great for you. In this environment, the "letter T" hack never had a chance.
It is a fact that, algorithms play a very vital role in influencing the search results. But we are also aware of the fact that Google has human evaluators — thousands, in fact
-Scott Huffman, Engineering Director.
About the Author:
Learn about http://www.hirank.com/ at the www.hirank.com/blog/
Click here to read more on http://contentlib.co.za.
Add Comment
Search Engine Optimization Articles
1. Top Seo Services In KondapurAuthor: vijji
2. Technical Seo Service In Dubai – Building A Strong Digital Framework For Growth
Author: gcc
3. Best Free Ai Tools To Generate Leads In 2025: The Ultimate Guide For Businesses
Author: GO DIGITAL ALPHA
4. India’s Aviation Growth Driven By Rising Passengers - Skyone Blog
Author: Skyone
5. Seo, Aio, Geo, Aeo & Sxo Explained
Author: Dealsinsight
6. The Future Of Farming: Agriculture Equipment Market Revolution
Author: Pujitha
7. Drive Digital Success With Top Seo Experts In New Zealand
Author: Top Rank Digital
8. Top Digital Marketing Services For Wedding Management Firms In 2025: How To Get Maximum Leads
Author: GO DIGITAL ALPHA
9. Smm Panel For Youtube Watch Time: Accelerate Your Channel’s Growth And Monetization
Author: yoyo media
10. Indian Smm Panel For Instagram: Affordable Growth Solutions In 2025
Author: my smm panel
11. Best Instagram Smm Panel
Author: my smm panel
12. From Local To Global: Affordable Seo Company In Mumbai Forefront
Author: HD99SOLUTIONS
13. The Trend Spotter’s Guide To Seo: What’s Working Right Now
Author: Brainwavesindia
14. 500 Free Guest Posting Sites List (da 30-90) 2025
Author: hightechsaini
15. Reasons To Rely On An Ecommerce Website Design Company
Author: Liam Mackie






