HomeButton
Introduction
Curriclum
Programs
Project
Resources/Ideas
P16-Portal
EdSeek
Help
 

 
How Web Spiders Work
 

     

How do the public search engines find the pages on your website and index them? The public search engines use automatic scripts to find webpages, read the text, parse it and store it in a database. Finding the base site for your webpages is easy because they are registered names such as www.uiuc.edu, www.shiloh.k12.il.us. However once the web-bots or "spiders" find your base page at the top of your site they they are programmed to follow links. They follow links, they don't follow java script navigators or image maps, they follow links. If your site contains internal pages with no link from the top base page then those interior pages are "invisible" to the spiders and are never indexed. Thus the are never shared.

 
Media Player needed to view tutorials - http://microsoft.com/windows/windowsmedia/
 
 
 
The secret to making your content "visible" to the public search engine spiders is to connect a sitemap.html document to the base page in your website. A sitemap is the blueprint and the master key giving the spiders access to all the content on your site. So, can you also specify content which they do not access? Yes, look at the information on robots.txt and robot metatags.
 

 

Introduction
Curriculum
Courses
Projects and
Resources
P16-Portal
EdSeek
Help
ePortfolio for Terence Sullivan
Last updated 8/01/2002
tsulliva@comwares.net