How will Google find my web pages?
The process by which Google finds and indexes new web pages is known as ‘crawling‘. The crawling is done by a huge set of Google computers which together are known as the Googlebot. The Googlebot, sometimes also know as just the bot, robot, or sometimes as a spider, uses a program to determine which web sites and web pages should be crawled and how often it should come back and crawl that website again.
One of the principle web page features the Googlebot looks for is links. Links provide the main list of pages to be used on the next crawl. New web sites and web pages are then found via these links on the next crawl. Whilst it is possible t suggest pages to Google, or include the page in a Google Sitemap, very obviously the best way of ensuring pages are found and indexed by Google is to make sure the web site and all web pages have links pointing at them.
Web sites with constantly changing and regular new content are often the the ones which are crawled regularly. Adding new web pages on to a highly crawled site can see them appear in the Google index within days.
In addition to the links, the Googlebot will also look at the words and their position on each page. This will also cover several key parts of the web page, including the title tag, header tags and image alt descriptions to discover the theme of the page.
By simply taking in to account how this process works, it is a relatively easy process to ensure web pages are well indexed by Google. Firstly, ensure all pages have links pointing to them. Without links the page will simply never be found. Secondly, give each web page some unique text content, to ensure the Googlebot has a reason to index that page. This includes a good title tag, header tags reflecting the page and page section content, and lastly that every image has a relevant alt description. Combine this with regular new content and even new web pages on your site, and you will both encourage Google to visit, and ensure re-visits are regular too.
Bookmark this page: