The promise of high search engine rankings, and the ensuing traffic, is making some very large academic publishers use a black-hat spamming technique called 'cloaking' to attract visitors to their sites via the search engines. The idea of cloaking is simple: when the search engine indexing crawler requests a page, the website gives the crawler one version of the page. When a human visitor requests the same page, they see a different version. Distinguishing search engine crawlers from human traffic is very easy, and in the case of Google and MSN/Live, it's 100% fool-proof.
The publishers in question all behave in a similar pattern: the search engine results are for a PDF file, presumably a paper that Google thinks is relevant. When a user clicks the link to the PDF, they are instead presented with information about how to purchase the article. So in this case, the cloaked version of the page is the PDF and the real version is the purchase form.
I've been getting more and more annoyed by these publishers, and two days ago John Baez from the Department of Mathematics, University of California, Riverside, complained about this spam. The comments show just how annoyed people are about this. John's post prompted me to join the naming and shaming of these spammers, and so without further delay...
SpringerLink
They seem to be the best at gaming Google. The papers hosted on springerlink.com rank highly in many fields of knowledge. The simplest way to out them is the search for [site:springerlink.com filetype:pdf], which queries Google for all PDF files hosted on springerlink.com. A screenshot of the results I'm getting is below. Just click any of these purportedly PDF files and see what happens. For the first result in the screen shot, I'm getting this page.

IngentaConnect
Next up is ingentaconnect.com. They don't have much of a search engine presence, but they still cloak. An example is the search in the screenshot below using the search term [site:www.ingentaconnect.com intitle:"journal"]. The third result I get is a PDF that when I click on leads to this page.

Royal Society of Chemistry
Yep, rsc.org. The screenshot below shows how to find cloaked PDFs by searching for [site:rsc.org filetype:pdf "carbon dioxide"]. The very first result leads to page asking for £22.

Taylor & Francis
T&F host a lot (all?) of their journals on informaworld.com. So what does Google say for [site:www.informaworld.com filetype:pdf "carbon dioxide"]? The third result in the screenshot below takes me to a page asking for £18 this time.

Conclusion
Well, the publishers are clearly cloaking their pages for the Google crawler. This contravenes Google's webmaster guidelines, and lesser websites have been removed from the Google index for similar tactics. What's amusing is that Google keeps making a big fuss about how cloaking is bad but doesn't do anything about these big publishers.
There is a question here we have to discuss: is this really cloaking if paying customers eventually end up seeing the same content that the search engine crawler sees? Some think in these instances it is not cloaking, but I beg to differ: What the average user sees should be identical to what the search engine crawlers see. If a page is in the search engine index it means that anyone can have access to it, without the need to register (even if free) or paying. That's my 2c. Take it or leave it.