Header

« The case of the trademark and the purple monkey | Home | This blog is bullshit… SEO style »

SEO Toxin: Directory-like URI Structures

By Sebastian | February 2, 2010

SEO Toxin: Directory like URI StructuresRant by Sebastian

Last time I looked, search engines evaluated links when drawing site hierarchies and link graphs. They give a dead rat’s ass if your fucking URIs match utterly meaningless file system structures. URIs are totally independent of OS restrictions, hierarchies, and brain farts as well.

So why do so many “SEOs” out there still advise their clients to maintain URIs within a (pseudo) file system hierarchy? Because someone replaced their brain with a pile of bullshit. Or pseudo-PageRank. Or both. Or other disgusting crap.

Well deserved insults left aside – here comes the academic explanation. It’s all Google’s fault. Back in the good old days when Google was dancing monthly, they had a faulty algo in their toolbar that kinda “guessed” PageRank, partly based on directory structures. It displayed a green pixels width (of upper directory) minus one or two pixels for URIs without an entry in the propagated toolbar PageRank database.

If an indexed page example.com/holy/index.html showed 4 green pixels, a not yet indexed page like example.com/holy/crap.html sometimes showed 3 green pixels.

Of course PageRank isn’t measured in green pixels. Savvy SEOs knew that toolbar PageRank is for entertainment purposes only, and that guessed toolbar PageRank means less than nothing with regard to free search engine traffic. Unfortunately, not all webmasters and SEOs are savvy. Actually, most aren’t, thanks to stultifying webmaster hangouts and the SEO blogosphere.

Many clueless SEO clowns chasing toolbar PageRank begun to serve all their crap from the root directory. Until the next SEO myth came out. Well, breadcrumb navigation is a great thing to do, if it makes sense, but it doesn’t boost PageRank on “directory level” via underlying URI structures (mimicking a static file system). Not in the root, not in deeper levels. Never. PageRank is solely influenced by links and their attributes.

Changing URI structures from root-only to equally keyword stuffed directory structures resulted in useless 301 orgies, making crawling and indexing more complex for search engines. In fact, it created redirect chains like example.com/products/sku (initial version) to example.com/sku (root-only version) to example.com/category/subcategory/justanotheruselesslevel/sku.

Two more revamps (plus server name canonicalization), and search engines won’t index any product page, because five redirects in a row is the maximum. There’s no maximum when it comes to SEO myths, so probably most over-SEO’ed pages will drop out of all search indexes soon.

I love idiocy. Really, I do love it. I’m going to launch just another SEO myth like “replace slashes with backslashes because Google fell in love with IIS” and that’s four. Next up is “replace \.htm|\.php|\.html with \.asp|\.aspx because Bing gives those script name extensions more weight” and that’s five. The day after tomorrow I’ll dominate the InterWebs. Aaaahhhrrrggg…

So why is the totally flawed concept of hierachical URIs, as in file systems, so popular? Because meaningful URIs make sense for (bookmarking) users, and increase SERP CTRs. Not that a visitor cares about the hierarchies a webmaster considers logical. The opposite is true.

E.g. example.com/widgets/green/xxl is very much webmaster friendly, but meaningless to users. A visitor would appreciate example.com/green-widgets/ and prices for all sizes (XS to XXL) on this page (even better would be example.com/widgets/ where the punter can chose color as well as size). A search for [green widgets in XXL] or so delivers the desired sales pitch just as [green widgsts] (sic!) without a size.

In fact, on very few sites any hierarchy in URIs makes sense at all. If there’s no hierarchy from a user’s (current) point of view, example.com/sku (or example.com/unique-keyword-phrase or example.com/buy/unique-keyword-phrase|sku) is the best choice (for a good looking and meaningful URI). That doesn’t mean you should drop hierarchical breadcrumbs. It means that URIs have nothing to do with the structure of your vendor’s data feed, and that breadcrumbs aren’t static.

In real live, hierarchies just don’t work. Most users don’t browse Yahoo’s directory or the ODP, they perform a search. This sort of categorizing is flawed by design, because the result is always subjective and therefore not generic enough to be useful for everyone. Don’t say “but Yahoo did it too, so it can’t be that wrong”. Trust me, it’s outdated bullshit, a relict of the Internet’s early Jurassic. Just because my ancessors buried their dead bodies in trees, that doesn’t mean that I can’t have a funeral at sea.

A node (Web page) can appear at many coordinates in a webbed structure (Web site), and each coordinate can be expressed as another navigation path (breadcrumb) leading to this node. That makes the breadcrumbs transient, and you must not use transient attributes or behavior as (parts of) persistent identifiers, like URIs.

Rest assured Google can (at least will) handle dynamic breadcrumbs without losing the node’s actual context (with regard to search query relevance), so ditching on your static breadcrumb components in URIs will neither lower SERP CTRs nor uglify SERP displays. You’ll get a breadcrumb’ed SERP display even for ugly URIs like example.com/p?id=Hj8TSc&ctx=k0Oh5Ew, because the breadcrumbs are gathered from links, not from path components of URIs.

In other words: your information architecture should be based on estimated (even better: tested!) user behavior, not on your product portfolio or technical indicators. Make short and meaningful URIs for users. Provide short click-paths to all of your contents. Interlink the hell out of your page portfolio. Link for the sake of your sales and easy navigation as well, not hierarchical following alien structures like category/vendor/product/color/size.

If it makes sense, search engines will follow suit. They don’t care much about path components of URIs. A page linked from the root index page or a powerful hub will rank well regardless whether it’s URI is example.com/sku or example.com/products/sku. It’s liked by users when its URI is example.com/green-widget.

You don’t even need to provide a consistent URI pattern. Having a ton of ancient example.com/sku URIs plus fewer example.com/meaningful-string URIs for newer products as well is fine with the engines, and fine with your users. As long as your linkage is user friendly.

Think! Be creative. Don’t fall for ancient sagas.

  • Share this on del.icio.us
  • Stumble upon something good? Share it on StumbleUpon
  • Share this on Technorati
  • Share this on Mixx
  • Post this to MySpace
  • Share this on Facebook
  • Tweet This!
  • Subscribe to the comments for this post?
  • Share this on Linkedin
  • Sphinn this on Sphinn

Topics: Server-side silliness | Comments

Posted on 02 February 2010 by Sebastian

In your code fixing your bugs.

Sebastian has written 2 articles for SEO Bullshit

  • linusplus
    You know why I like reading your articles ? Because they are full of "bullshit" (the word)...

    No really, you make me smile and laugh while getting useful seo infos at the same time... just great !
  • jfitz
    The directory structure just helps with site upkeep and organization from a webmaster point of view. It's just good practice. And in turn that can make SEO tasks easier.

    But from an overall myth debunking point of view, you are right - a URI could theoretically be located anywhere, get indexed, and get ranked. Makes no difference.
  • tobto
    Many good points, Sebastian! Indeed, good linking structure is based on UI. But I would add we go to Semantical Web and our end station is the Giant Global Graph, at least RDF. Semantics is the key.
  • John F. Jones III
    Sebastian - haven't read something of yours in a long time and like in the past, I wasn't dissapointed.

    I've wrote this comment five times over already becaue I find myself agreeing and then disagreeing with some of your points in this post. I'm not sure I'll ever be able to successfully put my words on paper.

    I'll cover two things and hope I make sense:

    #1. Back in 2005 the company I work for LOVED web directories with a straight forward URI structure such as example.com/ca/Riverside-County/Corona/. We felt the Corona page itself along with the county page before it would help boost the relevancy of our customers link on the page.

    #2. Now days when I go back to visit those exact same directories with pretty URI, I find that 70% - 90% of their internal pages aren't even indexed anymore. The biggest fault IMO not being exactly the URI structure but the lack of internal linking. So as you put it.. I fully agree with your statement, "Interlink the hell out of your page portfolio".

    Well written boss!
  • Marty Martin
    Preach it brother!
  • Adam J. Humphreys
    Breadcrumbs leave a lot of room for canonical errors, and a lot more work. Siloing however does make sense as long as it's not more than 3 clicks deep for user friendly reasons. It's been suggested that the deeper the link content the less relevant Google considers the content. I however can not confirm this but Matt Cutts implied it in one of his videos. I believe what he was suggesting is the user friendly links side of things especially in reference to getting to content in 3 clicks or less. Taxonomy in relevant structure helps search engines/users to easily determine what directories are relevant to user queries. This is especially true for geolocation with /ca/ /us/ /de/. Using webmaster tools we're able to use these heirarchies to specify regional information so in certain circumstances the URL structure is important for functional reasons.

    I believe you're right about search engines changing their algorithms for overly optimized sites as they're trying more, and more to indentify more so by content rather than SEO tweaks. I think we can expect the Google slap a lot more readily in the future.

    While URLS are not especially important when considering stuffing in keywords(especially if over optimized) they are a variable of significance when it come to being user friendly. Google highlights the keyword content in URLS which indicates that it does in fact find it relevant. Google is also now highlighting synonyms more so than in the past which indicates that LSI is also a factor when considering variable expert verbiage pertinent to the content at hand. Keyword research to determine the most powerful of these synonyms with Google Wonderwheel, SERP pages, SEOmoz, and Adwords tool should give strong determining factor of what keywords to be using.

    It does not make sense to make duplicate content pages just for different keywords such as /XL/ /XXL/, as that's just setting yourself up to get wiped out by the next site that uses a single page. It does however make sense to structure your site in a user-friendly manner that helps both your users, and search engines communicate more effectively.
  • jillwhalen
    Yep...that's one of the myths in my SEO presentation of the same. It's mind boggling how many of these myths we see propagated every day. Worse yet, it can be from some very popular SEO community sites.
  • Jono Alderson
    But.. but.. the status quo! What have you done? :(
  • JohnCarcutt
    "useless 301 orgies" .. Officially the phrase of the week! Great write up Sebastian.
  • Toot
    Sebastian: No, I read the whole post. I just think your apprasial of breadcrumb naviagation is perhaps more combative than perhaps you think it looks (you are pretty harsh)

    I'm defending breadcrumb/hierarchical navigation from the point of view of the development of medium size/large sites that need definable routing patterns to display content in a logical manner. You make no mention of this in your post and how costly not developing like this can be.

    I'm not necessarily disagreeing with you. :)
  • Sebastian
    Well, breadcrumb navigation is a great thing to do, if and where it makes sense, and that can be in particular site areas or site-wide. That doesn't mean that navigation path components should become URI path components. Especially not when depending on different views on the contents many paths can lead to a piece of content, regardless whether that's obvious at the time of the first IA draft, or a possible future insight. If the implementation of flexible nodes (that don't rely on URI structures to know what to do) seems too expensive, probably a second look at the IA, the development team, and chosen instruments/tools makes sense.
  • activetraffic
    damn right, it just doesn´t matter for rankings!
    the only positive effect is (once you have good rankings), that clickrates might be higher because of "speaking" URls...
  • Toot
    SebaastianX: Routing is routing. Any developer who is asked to produce a site with urls that have no logical and easy routing structure is just going to laugh and/or charge you a lot more for what is essentially a cosmetic difference (I've been the one being laughed at...)

    Also considering Google utilises breadcrumbs in their own serps (one assumes for user benefit), ignoring them for the newest SEO craze would, in my opinion, be rather foolish.
  • Sebastian
    Toot, did you get the impression that breadcrumb navigation at all is crap? I don't agree. Prolly you've just skimmed the post.
  • Sebastian
    There may be cases where breadcrumbs in URIs make sense, no rule without exceptions applies here, too. It is, however, very unlikely. Usually we're dealing with historical contamination then.

    Breadcrumb navigation with pretty URIs that don't contain any categorizing is doable, and it's a sensible thing to do.

    As for PageRank, given it's so 2004, you can still see its bad impacts on gazillions of weak site structures "optimized" for toolbar PageRank.
  • Toot
    Eh? In websites with varied types of content, you need a formalised directory structure so you can route the web application to show the right content on the right page.

    example.com/green-widget offers no "pattern" for the web app to determine whether that page should return a product or an article (for example).

    Whereas:

    example.com/products/green-widget tells the app that the it is a product page about green widgets (the "/products/" part of the url is the pattern)

    and

    example.com/article/green-widget tells the app that the page is an article page about green widgets. (the "/article/" part of the url is the pattern)

    These forms of urls have very little to do with SEO and have everything to do with building a web app that does what it is suppoosed to.
  • SebastianX
    Toot, there are so many ways to skin a cat.
  • SEO Mofo
    You're just crabby because the nodes in my webbed structure have prettier URIs than the nodes in your webbed structure.
  • Ani Lopez
    I agree in when you say "your information architecture should be based on estimated user behavior".
    "even better: tested!" of course but if you are managing a huge site redirections game would become a nightmare so benefits would be quite difficult to measure.

    Directories like structure, while reasonable, create fantastic room for content optimization (again, specially for big sites) so I would not like people believe from this article they are not useful at all and embrace another stupid SEO fashion.

    Said that I guess we're going to enjoy this site.
  • Kevin Spence
    Great post. I especially liked your liberal use of expletives, but that's beside the point.

    Mimicking a directory structure gives an observant user a decent indication of where a page falls within the whole of your site, and if you use breadcrumbs, it gives extra internal links to your meaty 'category pages' -- this is all.

    Pagerank is so 2004. And to give you an idea of how long ago that was, I had friends then. Which says both a lot and nothing at the same time.
blog comments powered by Disqus
  • Search the Site

  • Other Shit

    • About
    • Contact Us
  • Latest Bullshit Busted

    • SEO Practitioners who sell MY Knowledge suck!
    • How You Know When Your Client Isn’t Really Interested In SEO
    • The Hit & Run Snakeoil Salesmen Are Back
    • SEO myths & crap; LSI and Google
    • Being an SEO Pisses People Off – FACT!
    • Thank you nurse but I’ll take over from here
    • Facebook Surpasses Google… But Really?
  • Crap on Tap

    • Business Bullshit
    • Myths and Crap
    • Non-SEO BS
    • Notable Nut Jobs
    • Other Stuff
    • Server-side silliness
    • Snake Oil
  • Authors

    • EricWerner (1)
    • Goosh (2)
    • Justin Parks (1)
    • JVRudnick (1)
    • Nickfb76 (1)
    • Rishi (3)
    • Sebastian (2)
    • SEO Mofo (2)
    • theGypsy (8)
    • thehodge (1)
    • tnash (1)
    Powered by Authors Widget
  • Ranting Hall of Fame

    • Aaron Wall
    • Andy Beard
    • Darren Slatten
    • Dave Naylor
    • David Harry
    • John Andrews
    • Lisa Barone
    • Marty Weintraub
    • Michael Gray
    • Michael Martinez
    • Michael VanDeMar
    • Rumble Pup
    • Sebastian X
    • Skitzzo
  • Feed the Need
  • Want to post on SEOBS?

    If you're interested in posting, get in touch!
  • On the Radar

    • yesterdays posts on SEOBS - http://bit.ly/d50aSF and http://bit.ly/cW7WhK 5 hrs ago
    • LOL RT @neyne: looking forward to the SEO industry going back to fighting and badmouthing each other. this is boring 5 hrs ago
    • yo ho fellow bullshit detectors... ready to rumble...send me any BS U come across m'kay? 5 hrs ago
    • New Post - SEO Practitioners who sell MY Knowledge suck! http://bit.ly/cW7WhK 1 day ago
    • More updates...

    Posting tweet...

    Powered by Twitter Tools

  • Stuff

    Twitter
    SEO Dojo
    TopRank Big List
    Webmaster Radio
  • SEO Bullshit | SEO BULLSHIT slammed together by No Bullshit Social Media