Archive for the 'Supplementals' Category

Googlebot Refreshing Supplementals? • Thursday, May 25th, 2006

Weird 404 error email I sent myself yesterday (url hidden to prevent linking to an adult domain):
HTTP_REFERER: [blank] HTTP_HOST: www.domain.com PHP_SELF: /fgdfgfert4534.html REQUEST_URI /NONEXISTENTURL.html REMOTE_ADDR: 66.249.65.69 TIMESTAMP: 5/24/2006 9:15 PM
Quick explanation: I rigged my dynamic pages, so a request to retrieve “maroon-widget.html” 404s and triggers an email if I don’t have “maroon widget” in my […]


My Next Obsession - Getting a Site Back in the Index • Tuesday, May 23rd, 2006

For the last two months, I’ve been maintaining a holding pattern with one of my adult sites, to no avail. The general concensus was that something was up at Google and I shouldn’t do anything drastic. But now I’m going to start working on my site again. Till I see some progress, how to get […]


Duplicate Content Revisited • Sunday, April 16th, 2006

Tedster wrote a meaty post in WMW concerning duplicate content:
Google tries to select the dupes and then put all but one of them into the “supplemental index”. If a domain has just a few instances of duplication like this in the Google index, things tend to go on as normal. But when many, many urls […]


Supplemental Test Update April 13th, 2006 • Thursday, April 13th, 2006

I’ve been generally keeping up with WMW posts and other SEO blogs and even kept webmaster radio running since noon today, but nothing is really grabbing my interest. I do have a pile of blog post drafts sitting around unfinished, though some of it is so specific to my domains that I’m not sure who […]


Why I don’t Like Index.html • Saturday, April 1st, 2006

The obvious answer is supplementals in Google. I’ve used index.html on about 3 of my domains and since I use Dreamweaver, and sooner or later I make the mistake of linking to a page using /index.html and boom… Google will index it. Even this domain has /index.html for root url. It’s a good thing that’s […]


Robots.txt Before Linking Up • Friday, March 10th, 2006

First thing you should do after you buy a domain is install .htaccess that deals with canonical issues like non-www and /index.html.
Second thing you should do is install a robots.txt that prevents Google from crawling anything except the domain root.
If a hacker decides to submit your urls with a Google url removal tool, this may […]


Wrong SERP Snippet for Cache • Wednesday, March 8th, 2006

I’ve always assumed title/description snippets displayed in the SERP reflects what’s in Google’s Cache. But now, I’m starting to see at least one page where title/description doesn’t match what’s stored in the cache.
Here’s an example:
The cache of one page I’m looking at (I won’t post the url since its adult related) is dated 3/5/2006, […]


Google Cache • Monday, March 6th, 2006

I just checked my site at 216.239.59.147 and noticed a huge drop in pages indexed. Either I’m still doing something wrong or Google is hiccupping again.
I need to check my pages on this DC and see how many of my pages including subdomains are indexed correctly.
Since Google keeps falling back to cache from August […]