Click to Play

Dealing with Load Balancing Issues
As most everyone knows, large and complex websites require a lot of hosting systems. Unfortunately, a problem occurs since many hosting systems create...

Web News

Setting Up Your Site For The First Time
All right beginners pay attention and I'll instruct you "How To Set Up A Web Page Free"!!! There are a lot of sites out there that make you believe that they are going...

Increase Your Site SEO With Chrome Extension
Ever since Google has allowed Chrome Extensions, a number of interesting SEO tools have been popping up. These are now beginning to rival some of the of the toolbars and add-ons available for Firefox...

Testing Your Site's SEO With The SEO Report Card
Google has issued what they are calling an SEO Report Card, an effort to provide Google's product teams with ideas on how they can improve their products' pages using simple and accepted optimizations.

Using Open Source To Reach A Broader User-Base
Over the last weeks Alfresco, Sonatype and WaveMaker made their own decisions about licensing. Alfresco went LGPL, Sonatype - a company with a strong...

Finding The Real Truths About SOA And Cloud Computing
I had the pleasure of presenting at Utah St.'s 26th Annual IT Seminar yesterday. The conference was planned and run by the students. I was impressed both by the...


05.06.10

Archiving Your Content Rich Sites

By Michael Gray

Today's post is an answer to a question I took a few weeks ago: How to organize archives/categories on Wordpress for news site/blog that publishes a lot of articles, around 30-50 daily.

The first thing I want to bring up is that generally, if you're publishing that volume of posts per day, your posts probably are date sensitive, so you don't mind Google attaching a date to your posts. You can include the date in your URL structure, but it's not necessary. However, you do want to have date archives available and close to the top. As an example, in the top of my masthead, you'll see a link to the archive pagewhich has a link to every month I have published a post. So if you wanted to reach a post I wrote, the path is 4 levels deep
Home> Archives> October 2005> Bacon Polenta

This is what people mean when they mention a flat site architecture or crawling path: you aren't more than 4 clicks from any other post. You also want to make sure you have your robots.txt and robots meta tags configured properly to allow the spiders to crawl that path. If you are publishing a very high volume of posts/pages, you're going to want to get as many links on the archive page as possible, without becoming excessive. Google recommends no more than 100 links per page. In reality that number is really affected by the trust and authority of your inbound links or your link equity.

If you aren't familiar with the term link equity you should read this article by Eric Ward on link equity. Basically, your site's link equity will determine your crawling depth and crawling frequency. The more links you have and the stronger those links are, the deeper the search engines will crawl and the more frequently they will re-crawl it. This is a difficult problem for new websites: they need to add content, but if they add too much too soon, it won't get crawled. So new sites need to balance content creation with link building.


Some other tools you can use to help flatten out a website and increase crawling depth are breadcrumbs. Joost De Valk makes an excellent breadcrumb pluginfor Wordpress websites. A related posts plugin that changes the related posts at the end of each post will also help. I like yet another related posts plugin. Also make sure you are generating an HTML sitemapI like the dagon sitemap generator.

So, to wrap things up, here's what you want to do:

• Provide one or more short crawling paths for spiders. Try to keep the path as short as possible because anything beyond 5 levels is a problem.

• Make sure your robots.txt and robots meta tags don't accidentally block the crawling path.

• Provide alternate crawling paths or access to the website content with breadcrumbs, related posts, and sitemaps.

• Try to minimize maintenanceand make sure all or as many of these solutions as possible update automatically.

Comments


About the author:
Michael Gray is SEO specialist and publishes a Search Engine Industry blog at www.Wolf-Howl.com. He has over 10 years experience in website development and internet marketing, helping both small and large companies increase their search engine visibility, traffic, and sales. Michael is a current member of Internet Marketing of New York ( IM-NY.org) and a guest speaker on Webmaster Radio. He is also an editor for the popular search engine new website Threadwatch.org.
About DevWebProUK
DevWebProUK is for professional developers ... those who build and manage applications and sophisticated websites. DevWebProUK delivers via news and expert advice New Strategies In Development.





DevWebProUK is brought to you by:

SecurityConfig.com NetworkingFiles.com
NetworkNewz.com WebProASP.com
DatabaseProNews.com SQLProNews.com
ITcertificationNews.com SysAdminNews.com
LinuxProNews.com WirelessProNews.com
CProgrammingTrends.com ITmanagementNews.com






-- DevWebProUK is an iEntry, Inc. publication --
iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509
2010 iEntry, Inc.  All Rights Reserved  Privacy Policy  Legal 

archives | advertising info | news headlines | free newsletters | comments/feedback | submit article


Delivering IT Solutions DevWebProUK News Archives About Us Feedback DevWebProCanada Home Page About Article Archive News Downloads WebProWorld Forums Jayde iEntry Advertise Contact