Colnect, Connecting Collectors: Google

The world's most popular search engine has now indexed over 245 Million Colnect pages! That is a phenomenal amount. That is almost 10x times more pages presented than in Colnect's previous milestone. Colnect is ever rising and the results are definitely showing worldwide :)

What does it mean? It means that our catalogs are growing and so there's much more information that is made available to everyone around the world. It means Colnect is flourishing and that Google accepts its original content as something valid and useful.

Globe "Now Your World is One" United States Phonecard, newly added on Colnect

Chinese Puzzle Phonecard, newly added on Colnect

To give you readers a better idea of how far the Colnect catalogs have come since then, here are a few more numbers. In comparison, Colnect's Tea Bag catalog, then only in its beginning, has added more than 50,000 items, making it 20 times larger than its original starting point. The Stamps catalog, one of the most well-entrenched collectible fields on Colnect, has gone from 90,000 stamps to almost 450,000, adding close to 400,000 Stamps at that time period. The Colnect Phonecards catalog, then already celebrating a respectable size of 222,222 Phonecards has now almost tripled its size into over half a million phonecards and the largest Phonecards catalog in the world. This is not to mention (just for the lack of space in the article, since they do deserve a mention) all the new categories that have been added since then.

Emblem, Mother and Child Vietnam Stamp, newly added on Colnect

Growing catalogs mean a growing community. Colnect's collectors, then a lucky group of 8.888 members, has ballooned tenfold and will soon complete the even luckier 80,000 collectors. Colnect has then been experiencing 111,111 visits a month which are now more than 666,666 visits made by almost 400,000 users a month. The site rating given by the most reliable source available on website ranking, Alexa, has gone up from 83,000 to 36,485, climbing almost 50,000 steps up the competitive online ladder, which is more logarithmic than linear.

Grøn Te med Citrus, dif flap Denmark Tea Bag, newly added on Colnect

Earl Grey, dif flap Denmark Tea Bag, newly added on Colnect

All this data may seem cold, but what must be seen is that they are the mirror that reflects the work, dedication and heart of contributors, translators, editors, coordinators, too numerous to count. Each number contains years of searches, cooperation, attention to detail, challenges overcome and achievements made. It shows how much could be created when a group of people, no matter from which country, sex or age, gathers together over something they are truly passionate about.

Chinese Art China Phonecard, newly added on Colnect

Colnect's catalogs are meant to serve collectors wherever they may be and they do. On behalf of collectors everywhere we thank you, Colnect volunteers, for your efforts to create and perfect this invaluable source of information. This benefits all those who have or will chance upon collecting along their path.

Olivia sayana Ravenel Netherlands Antilles Stamps, newly added on Colnect

On all pages of our forum and private messages system, there now is a translation button. Click it to get an automated translation of the entire page you see.

Colnect is available is 58 languages. Translations on Colnect are maintained by volunteers. They do a fantastic job but they cannot translate our forums and your private message. The automated translations will not be as good as the manual ones but they should be good enough to help you understand simple messages.

NOTE: Mistakes do happen with automatic translation. Use simple sentences and try to repeat the meaning of what you want more than once. This will help ensure that the meaning of what you wanted is understood.

So no, we're not building a new Tower of Babel but we're now making it much easier for you to communicate with collectors from all around the world. It's now easier to make friends in more countries with Colnect.

This integration has been done directly in our phpBB forum system and the same concept can easily be applied to forums all around the Internet who would like to use it. If you run such a forum, we'll be happy to help you implement this addition.

Wishing you happy collecting :)

Colnect's rapid growth is being reflected in its presence on the world's most popular search engine, as we now have an astounding 29 million pages indexed on Google! This number is made even more impressive by the fact that only 2 months ago, Colnect's total page indicies were at only about 10% of its current tally.

However, these numbers are quite dubious, as performing a search with Google's Canadian version yields only 1.1 million search results, while others produce page indices in the range of 3-5 million. This begs the question of how exactly Google calculates the quantity of indexed pages for a particular site and if this number has any reliability or basis in reality. Although Colnect has added many new lanugages and categories over the past couple of months that would explain a certain increase in pages, a 10-fold increase over this timeframe seems entirely unrealistic. Only time will tell whether this enormous figure can sustain itself or fall back to a total more in line with past values.

In an undoubtedly positive reflection of Colnect's impressive growth and development over recent months, Colnect's page rank on Google has improved to an all-time high of 5 (PR5), besting the previous mark of 4 (PR4). With our total collectors exceeding 8,000, 10 collectible categories being made available, and constant catalog improvements, it's no wonder that more sites link to Colnect and that Colnect's status is advancing on Google.

This upgraded page rank will likely mean higher Google search results for Colnect, which should translate into greater traffic, more new registrants, and ultimately an even better collection experience for current Colnect collectors. Given Colnect's current progress, it's quite reasonable to expect future upgrades in its Page Rank as the site becomes more recognizable and well-established on the web. However, this might require extra patience, since moving to a higher Page Rank becomes progressively trickier as the levels increase.

As the number of internet users and available web pages worldwide continue to grow exponentially, the importance of maintaining a high index on search engines is magnified with it. Consequently, many spammers or special-interest groups wishing to spread a particular message have developed more sophisticated techniques for "cheating" their way to the top of search results. One of these newer methods involves exploiting sites that display a 200 Code for error messages by adding the spammer's unsolicited content into Google's (or another search engine's) indices for the purpose of generating traffic the next time a user searches for that term. For more details about this technique, see this previous blog post on the subject.

However, a more interesting phenomenon is the recent adoption of this method by political organizations and other non-commercial action groups. For example, typing the phrase "Save Us From Berlusconi" (see Image 1) into Google generates countless results in this fashion, evidently a result of the efforts made by individuals and organizations opposed to the Italian Prime Minister to get their message across.

This was brought to our attention after the messages appeared in the search engine indices for the site Transposh. Similarly, these indexed pages can also appear even without a specific search being carried out for them (see Image 2), a trend that has been noticed by the Colnect administrator who reported the problem originally.

This relatively recent spamming method has the potential to undermine the legitimacy of search engine results and consequently make some users think twice before clicking on a link that appears at the top of their results list. Google and the other major search engines need to put a halt to this problem before it becomes even more prevalent and completely compromises the integrity of their search functions.

It's not uncommon to see weird requests coming to my server at Colnect but I found this one interesting since it came from GoogleBot, the bot used by Google to index the web for its search engine.

The request made by the bot was for the URL:
/warning_this_is_english_domain_to_solve_this_problem_submit_site_in_atoall.com.html

Needless to say, this URL never existed on my domain. Seeing the actual page of atoall . com, having the title "Hot girls pictures free games boys images local news all", made me suspect spamming.

Searching for this URL on Google currently gets 106,000 results for warning_this_is_english_domain_to_solve_this_problem_submit_site_in_atoall.com.html.
which means that Google has indexed that many pages which don't really exist on the other domains. Some very well known domains have this page URL indexed on Google.

How does it happen?

Well, some sites are configured to never return a proper 404 code to let bots and people know the page is not found on their server. They prefer returning a 200 code that tells bots and browsers the page is found. The page's content, displayed to the user, indicates that what the user was looking for was never found. Most users would never know the difference between getting a 404 or 200 code.

So why do they generate a 200 code?

Well, it makes search bots, like Google, index a page that has content which was searched by a user. The next time a user would search for the same term on a search engine, there is a chance that he'll get to their page. Also, as some plug-ins to browsers can "steal" 404 pages by replacing them with their own custom results, returning a 200 code prevents it.

Why shouldn't they generate a 200 code?

The downside of returning such pages is the obvious spamming by sites such as atoall . com and others which seek illegitimate sources of traffic. According to Alexa, the site has been gaining traffic since August and it wouldn't come as a surprised if this unique form of spamming Google's search engine has a lot to do with it.

Another issue is that the search engine may choose to penalize sites which return the wrong results. The search engine can easily know if that is the case by requesting randomly generated page URLs.

So now my only question is: how come Google didn't already penalize atoall . com and removed it from their search results?

As Colnect is using Google Analytics to measure our traffic, we're happy to learn about the change to their tracking script. Announced 2 days ago and now implemented on Colnect, the script will now be loaded asynchronously and thus not block other page elements from loading. This should results is slightly faster load times and improve user experience on the site.

So now the question is when such asynchronous code be available for AdSense? I see no reason why the ads shouldn't load only when the page has been rendered.

Colnect has reached a new milestone today by eclipsing the 7,000 collectors mark! This lucky milestone X 1000 indicates that Colnect continues to grow and reach new heights. At this rate, it won't be long before Colnect challenges E-Bay as the web's most popular collectibles site.

Remember: You can help Colnect grow while also earning money for yourself through our affiliates program.

We're also happy to let you know that Colnect has over 3.4 million pages indexed on Google. As more collectors on Colnect help out improving Colnect's catalogs by adding items, the amount of pages offered by Colnect is growing quickly.

Happy collecting! :)

Google sitelinks are links shown under some sites in a search results page.

Colnect now finally has sitelinks as the following image shows:

To see how it looks now, simply Google: Colnect.

They are supposed to be somewhat helpful in convincing users to click on one of your results and so that's good news for Colnect. I'd like to accredit it to some recent SEO efforts I've made but the truth is one never really knows...

The odd thing is how these links are chosen by Google's "automatic" process. The words chosen for quicklinks are "Stamps", "Phonecards", "Countries" and "Monedas".
The first two seem a perfect choice as they frequently appear on Colnect and are really a part of what the site is all about. The last two are quite a weird twist of Google's algorithm as "Countries" actually refers to the Banknote catalog - countries list on Colnect and the second one goes to the Spanish version of the coins page, rather than the English one.

Any tips on how their "automatic" algorithm works and how I could optimize for it would be appreciated. Until then, I am, as most, at the mercy of the Google gods.

In a previous post about what beta means, I've shared my opinion about Google abusing the BETA mark as to render it almost meaningless, as their example was followed by many other companies in the industry. I've decided that Colnect will not follow and remove its BETA mark as the system became widely used by real users.

Now it seems that Google decided to follow Colnect's way and remove the BETA: "We're taking the beta label off of Gmail, Google Calendar, Google Docs and Google Talk to remove any doubt that Apps is a mature product suite.".

Yes Google, it's about time!

To those who didn't follow the humor, I doubt it that the post on Colnect's blog made the difference to Google but it's funnier to think that it did :)

Google has announced Page Speed, a FireFox add-on that will help optimizing your website. Obviously competing with Yahoo's YSlow tool, it's worth it to check out what Google think their do better and help further optimizing Colnect Collectors Community.

A funny first thing I've noticed was the comments Page Speed gave on its own guide page. Maybe it's time for Google to start using their tool ;)

Do you have a website and using both AdSense and AdWords? If so, this post will interest you. You may start by looking at the attached picture.

AdWords and AdSense are the different sides of the same coin and Google is in the middle, biting on the metal. AdSense allows website owners to show different ads from different advertisers on their website and receive revenues for it. AdWords allow you to advertise your website on many other websites running AdSense. Google takes its commission for providing both these services. Though many other competing programs exist on the web, Google's generally have good reputation.

Advertising my site on my site?!?!?!

This morning, I was amazed to see an ad to my website for collectors on the same site! Had I clicked this ad, my AdWords account would have been billed for the click while my AdSense account would have received some revenue. Obviously the latter would be lesser since Google get their cut. This is, of course, ridiculous and should be automatically prevented by Google. Apparently, it isn't. Just to ensure you that I'm not an idiot, the ad is for colnect.com and the website is on the same domain - colnect.com

I usually view my own site with ads disabled so I don't accidentally click an ad and violate the terms of service. This time I was at a friend's house and am quite happy to have stumbled upon the aforementioned bug/feature.

Quick Remedy

Both accounts allow you to use filters and I've now added "competition filters" for my AdSense account that will prevent showing ads for any of my domain. It's also possible to filter out sites on AdWords but this can seemingly be done only on the campaign level so if you're running many different campaigns, it'll become tedious. See the attached pictures.

Though the event seemed to be organized quite well, there were network problems occurring which were the reason for me leaving the gathering earlier. At some stage I've got a warning on my browser about invalid certificates for Google.com and naturally didn't decide to accept the invalid certificates. I've told others around me that someone has probably been hacking the network but seemed most people were reluctant to believe it.

Turns out I was right, here's the email sent form "The Google Developer Day Team":

Developer Day
3 de noviembre de 2008 11:02
PLEASE READ: Unauthorized network activity at Google Developer Day

Dear attendee,

First of all thanks for attending Google Developer Day yesterday, we hope you found it useful. Unfortunately, we need to let you know about an incident which took place during the conference which you may need to take precautionary action on.

We identified unauthorised activity on the public wired Ethernet network which was provided by the convention centre for conference attendees to access the Internet. This may have affected a limited number of attendees accessing websites and online applications through the wired Ethernet connection. We have no evidence so far to suggest that the wireless network also provided at the event, and which was used by most attendees, was affected.

Due to the unauthorised activity, there is a chance that if you used the wired network, any user name and password entered to access a website may have been put at risk. When trying to access a secure website (a website using https), you may have received an alert indicating that the page had an invalid security certificate. In any case, we advise users as a precaution to change the passwords for any websites or services they accessed through the wired connection during the conference.

We're really sorry that this has happened but we believe that the vast majority of attendees won't have been affected by this incident. In the meantime, we look forward to seeing you at future events very soon.

The Google Developer Day Team

Google's Developers Day 2008 tour has landed in Israel today. Google is looking for developer love and is doing their best to attract it. The main message coming from Google is "Let's work together to make the Internet a better place. The better it is, the better we're all off". Not forgetting that we're talking about a commercial company, not a charity foundation, their approach is, at least on the surface, quite amiable.

Perhaps the most interesting parts of the day, in regard to Colnect, were related to OpenSocial. From their website: "OpenSocial defines a common API for social applications across multiple websites. Built from standard JavaScript and HTML, developers can create apps with OpenSocial that access a social network's friends and update feeds. By using a common API, developers can extend the reach of their applications more quickly, yielding more functionality for users."

What does it really mean? As I see it, FaceBook has taken too big a share of the social networks world than Google (and others) think they deserve. One of the main reasons for the success of FaceBook is the FaceBook applications written by developers and extending FaceBook's functionality in many different directions. The problem is that developers are mostly unable to devote many efforts to writing their social applications to all social networks out there and would thus focus on the biggest ones. Orkut (Google's social network) is surely not the biggest and so the way to convince developers that they should write Orkut-complaint applications, we now have OpenSocial.

OpenSocial supports other networks as well which altogether (according to their figures) serve nearly 500 million users worldwide. Though I'm skeptical of how someone knows my Linked In and Orkut accounts are of the same person (they probably don't share email addresses around), it's still quite a big figure which should be a motivation enough for a developer to focus on it instead (or in addition to) FaceBook.

Colnect is about to release some social applications to the social networks world to assist collectors integrate their collectibles hobby with their other activities and help spread the word about Colnect to fellow collectors. OpenSearch seems an appealing choice since it involves many networks which cater to different crowds which altogether might coincide better with Colnect's target crowd than FaceBook.

Last but not least, there's the issue of Chrome, Google's new browser. While promoting it as simply a means to make your surfing better, it would still have been a bit more amiable and transparent to say "we've made our own browser so that we can make Google search the default search engine and not allow you to block AdWords ads with some addon". If Google really would have cared just for the web users community, it would have simply put more efforts into the existing open source browsers. In my experience, Chrome is still much inferior to FireFox.

More than a month ago, Colnect's PageRank has changed to PR4. Now's the time for some statistics taken directly from Google Analytics deployed on Colnect.

Comparing the last 2 weeks with the 2 weeks before the change show 25% more traffic from Google. But what's more interesting is that there's 68% more traffic from Live and 58% more from Yahoo. So the PageRank probably did make a difference but is Yahoo and Live taking their information from Google? Perhaps it was vice versa and I just never stumbled upon tools to test my ranking with these search engines due to the lesser amount of traffic they bring.

Forgive me Google for I have sinned. I don't know the nature of my sin yet but I'll probably eventually figure out what has happened and how come my website, full of original content (it's the world's most extensive resource for collectible phone cards information), has a PageRank of 0, aka PR0.

Seems like the lovely WWW is made of a myriad of rules that dynamically change and are quite hard to obey, even if you're one of the good guys. Unlike the real world where you usually know what's good and what's bad, here you have to figure it out slowly, understanding that causality isn't as evident.

PR0 is considered a punishment but since Google work in mysterious ways (and keep their algorithms confidential), not even a prayer could work here. One has to search deep in his heart (and on the Internet) what it is that was wrongfully done.

So here's my guess
At the end of March, I've moved to a new hosting service and decided to also move to the new domain and so http://telecards.islands.co.il became http://telecards.colnect.com

Since colnect.com (the project which will soon accommodate many different types of collectibles) is still under development I've made redirects from http://colnect.com, http://colnect.org, http://colnect.info and http://colnect.net, as well as from the old http://telecards.islands.co.il

And so, it's possible that this is considered a domain Spam (when one has a few domain that link to each other to get better search engines ranking) although no bad intentions were involved nor any trickery.

But the site's still on top of the results
Strangely enough, although my website was seemingly punished, it's still on top on the results page when Googling for telecards.

There will be more posts on SEO, SERP, Google. Meanwhile you can check out these tools for measuring your website.

Tuesday, August 26, 2014

Google Indexed More Than 245 MILLION Colnect Pages

Monday, November 21, 2011

Automatic Translation of Forums and Private Messages (phpBB)

Wednesday, January 20, 2010

29 Million Colnect Pages Indexed on Google?

Monday, January 11, 2010

Colnect's Google Page Ranks Advances to 5

Wednesday, December 23, 2009

Save Us From Index Spamming / How to Make Google Carry Your Slogan

Saturday, December 5, 2009

New SPAM technique? "warning_this_is_english_domain_to_solve_this_problem_submit_site_in_atoall.com.html"

How does it happen?

So why do they generate a 200 code?

Why shouldn't they generate a 200 code?

Thursday, December 3, 2009

Google Analytics Asynchronous Tracking

Monday, November 16, 2009

3,440,000 Pages Indexed on Google - 7,000 Collectors Are Members

Wednesday, August 12, 2009

Colnect has sitelinks on Google

Wednesday, July 8, 2009

Google Follows Colnect and Takes off the BETA

Friday, June 5, 2009

Google Page Speed vs. Yahoo YSlow

Thursday, February 19, 2009

BE CAREFUL when using both AdSense + AdWords by Google

Advertising my site on my site?!?!?!

Quick Remedy

Monday, November 3, 2008

Who hacked the network on Google Developers Day 2008 in Israel?

Sunday, November 2, 2008

Google Developers Day 2008 in Israel

Thursday, September 4, 2008

How traffic changed from PR0 to PR4

Monday, May 12, 2008

PR0

Link and Search