Bluehost Web Hosting Help
Canonical URLs: Telling Search Engines which to Use
For any given website page, there are more than one URL that can be used to get to it. While this is extremely useful in certain situations, like when you want to view your page before the domain propagates, it sometimes can cause undesired effects.
Luckily most search engines, like Google & Bing, understand every page has multiple working URLs, so they usually correctly determine what pages are duplicates, and what URL is most likely the canonical URL. The canonical URL is the preferred URL for a page.
Examples of URLs for a Single Page
For example, depending on your setup, it may be possible to visit the same page by using the following example URLs:
If the domain is an addon domain, it may be possible to view the same page through these hypothetical URLs:
How Search Engines Guess the Canonical (Preferred) URL
First of all, it is important to note that even though all of the URLs exist for the same file, most search engines and your visitors will never encounter them and will not even know they exist. So, for example, that temporary URL your host gave you with your IP address and username in it will only be known by you, unless you tell someone else about it.
Of the ones it knows about, they only know it exists because someone told them about it, usually by using it in a link on a web page somewhere.
Search Engines usually find URLs a couple of ways:
- They found a link to your page on a web page they already knew existed.
- The found the link / URL in a site map and/or RSS feed.
- The link / URL was submitted to them directly, usually via their website.
- Somone visited your page while using their browser toolbar.
Once they find out about the page, they compare it will other pages that appear to be identical or almost exact matches to spot duplicates. If they spot a duplicate page, they then try to figure out which URL should be the canonical URL.
Although they keep their exact algorithms secret, there are some things that they are known to check:
- What URL everybody seems to be using when linking to you.
- What URL is used in your site map and RSS feeds.
- Whether you specified a canonical URL in the meta tags of your pages.
- For Google, whether you specified a canonical URL in Google Webmaster Tools.
- Whether the URL redirects to another URL.
Here are some useful resouces from Google and Bing:
- Google: Canonicalization
- Google: Specifcy Your Canonical
- Bing: Partnering to solve duplicate content issues
Recommended Help Content
Seach Engines do not include my domain
Why doesn't my domain name show up in search engines? (updated 2078 days ago)
Site Not First in Search Engine Results
Why is my site not the first result on Google or an other search engine? (updated 1419 days ago)
Search Engine Placement
How does Search Engine Placement work? (updated 2097 days ago)
Related Help Content
Entropy will not search .php files
Why will Entropy not search any php file? (updated 2079 days ago)
Entropy Search will not finish building the index for my site.
I am trying to rebuild my index for Entropy Search and it stops before 100%. (updated 2099 days ago)
Telling Google How Often To Crawl Your Website
A guide to configuring google's crawl rate for your site. (updated 887 days ago)
Telling Bing How Often To Crawl Your Website
A guide to configuring bing's crawl rate for your site. (updated 892 days ago)
Searching through a MySQL database in phpMyAdmin
Step by step directions on how you can search your database using phpMyAdmin (updated 899 days ago)
Google Flagged My Site as Malware
How to remove the "This site may harm your computer" warning in Google search results. (updated 1244 days ago)
Wordpress Install Without Automatic Database Creation
This article explains how to install WordPress when Mojo is unable to create the database automatically. (updated 81 days ago)
Blocked IP Addresses
This article explains some common causes for IP blocks. For security purposes we do not reveal any specific criteria that will cause your IP address to be blocked. (updated 53 days ago)