3/10/10

How Google Indexes the Web

To start this post, it helps to understand the program in use Google to spider and crawl on a site, the program commonly known as BOT, he is the program to crawl a site or page on your site regularly or frequently, depending on the popularity of a site .

Google bot to be divided in 3:

1. Adsense Bot
2. FreshBot
3. DeepCrawl Bot

Adsense bot, the name we've got this idea that the robot has to do with Adsense, he was assigned to study and ultimately identify the pages of Adsense publishers site with a message from the Javascript code is integrated into our Adsense code in the template Lay, the goal for your ad Adsense that appears relevant to the niche blogs and content of the publisher.

Freshbot is the most active bot on the other bots, she regularly visited a blog for crawling pages and pages of the most popular. Any number of these blog pages. However, the frequency of visits Freshbot magnitude can not be separated from the popularity factor of a blog / website, of course the more popular a web, the more often freshbot bertandang. News sites like BBC, CNN or sites like Ebay.com Online Marketplace, Amazon.com in the crawl in minutes (usually 10 minutes or so) each day.

Every blog that has been a while online, generally have what is called the Deeper Links, well when they Freshbot discovered during a visit, he will place a link deeper in queue list (queue) wait until Deepcrawl action.

This bot bot DeepCrawl arranged for each crawl once a month, before he acted, he first checked all the links that have been deeper ngantri in the database, then use it as refrensi when crawling a blog. in because of, he was in the program to visit each once a month, it will take a month to keep all of our blog content indexed. Pagerank update and a blog / page is closely related to the arrival Deepcrawlbot.

No comments:

Post a Comment