Hi all,
I thought I would stir things up a little- no doubt this has been argued many times before but I really didn't have time to go through over 1300 threads.
I SAY COPIED CONTENT IS NOT AN ISSUE WITH GOOGLE ........ thousands will disagree so let me elaborate.
Firstly I am a believer in explaining ones comments or viewpoints as best as possible so if anyone is going to reply to this post with a simple WTF, or are you kidding or similar without then offering thier reasons then please do not bother. Oh, and before we go too much further, I DO NOT PROFESS TO BE AN EXPERT so what you read here is my opinion only, nothing more, and everyone is welcome to agree to disagree
OK back to that rash statement, here is my logic / reasoning. Google does not have the capability to crawl every single page on the web let alone record and store everything its crawlers "have seen" on the web over a long period of time - it does store and record certain information for a certain period of time though we do not really know for how long, is it a month, a week, a year??. The crawlers use algorithms - these algorithms are essentially mathematically based programs put into a computer - the robot cannot read per se, it sees but doesn't read. Its like hearing but not listening, we've all done that haven't we? You hear something someone said to you but when asked to recall it you can't because you were not tuned in to actually listening - listening to the radio whilst you drive etc, etc.
So how would a crawler be able to tell if something is a copy of something else if it does not have it in its library. It will be able to determine it is copied IF it has a reason to in the first place and it has a trail to follow ......... yes, a link. If there is a link in the content then the crawlers will follow that link and bingo!! Secondly why would they look for a link? If the title of the content is the same as where it originally came from, and very much so if that title is in the "heading" of the page, then the crawler will possibly (not always) have seen that information recently on a crawl so that information would be in its current memory and trigger an alert.
Many things that Google say that its crawlers can do should be taken with a grain of salt - much is fact, some is moondust aimed at curbing practices they cannot control and minimizing content they cannot crawl. If they said copied content was ok how much more content would there be in the WWW - it would double or treble or more, as well as your own content you would use someone elses so as to boost your number of pages (I would). Copied content goes against the ethos of Googles position as professed to its users, i.e. to produce search results of quality that best match the searchers requirement ... bla bla bla. So of course they are going to say that they can find copied content.
And yes they can find it, but only ocassionally and only under certain circumstances. Importantly they DO NOT PENALIZE for copied content, they simply do not give you the brownie points you would get if it was original (like links, there is no such thing as a bad link, some are just better than others).
Google does not class free press releases, free articles etc etc as copied - why not? - is there a special code in the source html that tells the crawlers "hey its cool man we are allowing everyone to use this content" .... me thinks not Watson dear chap. The "free" stuff doesn't have any links in it - and not just in the text but anywhere including images, backgrounds etc.
How do I know this to be the case? - I DON'T - but in reading ship loads of posts on different sites it appears NO-ONE REALLY KNOWS - even Matt Cutts of Google contradicts himself on the issue (apparently). Applying my own algorithms sprinkled with some basic logic and a dash of street samrts topped up with "copied" ideas from others I think the above sounds kinda plausible.
So now lets have some comments. If I am wrong then I need to know (so I can delet a lot of stuff) - but you have to convince me not just poo poo me.
cheers,
Mike
Quality of the content - this is a related but totally different subject on which I also have some views.



- no doubt this has been argued many times before but I really didn't have time to go through over 1300 threads.
