« | Home | »

SEO Toxin: Directory-like URI Structures

By Sebastian | February 2, 2010

SEO Toxin: Directory like URI StructuresRant by Sebastian

Last time I looked, search engines evaluated links when drawing site hierarchies and link graphs. They give a dead rat’s ass if your fucking URIs match utterly meaningless file system structures. URIs are totally independent of OS restrictions, hierarchies, and brain farts as well.

So why do so many “SEOs” out there still advise their clients to maintain URIs within a (pseudo) file system hierarchy? Because someone replaced their brain with a pile of bullshit. Or pseudo-PageRank. Or both. Or other disgusting crap.

Well deserved insults left aside – here comes the academic explanation. It’s all Google’s fault. Back in the good old days when Google was dancing monthly, they had a faulty algo in their toolbar that kinda “guessed” PageRank, partly based on directory structures. It displayed a green pixels width (of upper directory) minus one or two pixels for URIs without an entry in the propagated toolbar PageRank database.

If an indexed page example.com/holy/index.html showed 4 green pixels, a not yet indexed page like example.com/holy/crap.html sometimes showed 3 green pixels.

Of course PageRank isn’t measured in green pixels. Savvy SEOs knew that toolbar PageRank is for entertainment purposes only, and that guessed toolbar PageRank means less than nothing with regard to free search engine traffic. Unfortunately, not all webmasters and SEOs are savvy. Actually, most aren’t, thanks to stultifying webmaster hangouts and the SEO blogosphere.

Many clueless SEO clowns chasing toolbar PageRank begun to serve all their crap from the root directory. Until the next SEO myth came out. Well, breadcrumb navigation is a great thing to do, if it makes sense, but it doesn’t boost PageRank on “directory level” via underlying URI structures (mimicking a static file system). Not in the root, not in deeper levels. Never. PageRank is solely influenced by links and their attributes.

Changing URI structures from root-only to equally keyword stuffed directory structures resulted in useless 301 orgies, making crawling and indexing more complex for search engines. In fact, it created redirect chains like example.com/products/sku (initial version) to example.com/sku (root-only version) to example.com/category/subcategory/justanotheruselesslevel/sku.

Two more revamps (plus server name canonicalization), and search engines won’t index any product page, because five redirects in a row is the maximum. There’s no maximum when it comes to SEO myths, so probably most over-SEO’ed pages will drop out of all search indexes soon.

I love idiocy. Really, I do love it. I’m going to launch just another SEO myth like “replace slashes with backslashes because Google fell in love with IIS” and that’s four. Next up is “replace \.htm|\.php|\.html with \.asp|\.aspx because Bing gives those script name extensions more weight” and that’s five. The day after tomorrow I’ll dominate the InterWebs. Aaaahhhrrrggg…

So why is the totally flawed concept of hierachical URIs, as in file systems, so popular? Because meaningful URIs make sense for (bookmarking) users, and increase SERP CTRs. Not that a visitor cares about the hierarchies a webmaster considers logical. The opposite is true.

E.g. example.com/widgets/green/xxl is very much webmaster friendly, but meaningless to users. A visitor would appreciate example.com/green-widgets/ and prices for all sizes (XS to XXL) on this page (even better would be example.com/widgets/ where the punter can chose color as well as size). A search for [green widgets in XXL] or so delivers the desired sales pitch just as [green widgsts] (sic!) without a size.

In fact, on very few sites any hierarchy in URIs makes sense at all. If there’s no hierarchy from a user’s (current) point of view, example.com/sku (or example.com/unique-keyword-phrase or example.com/buy/unique-keyword-phrase|sku) is the best choice (for a good looking and meaningful URI). That doesn’t mean you should drop hierarchical breadcrumbs. It means that URIs have nothing to do with the structure of your vendor’s data feed, and that breadcrumbs aren’t static.

In real live, hierarchies just don’t work. Most users don’t browse Yahoo’s directory or the ODP, they perform a search. This sort of categorizing is flawed by design, because the result is always subjective and therefore not generic enough to be useful for everyone. Don’t say “but Yahoo did it too, so it can’t be that wrong”. Trust me, it’s outdated bullshit, a relict of the Internet’s early Jurassic. Just because my ancessors buried their dead bodies in trees, that doesn’t mean that I can’t have a funeral at sea.

A node (Web page) can appear at many coordinates in a webbed structure (Web site), and each coordinate can be expressed as another navigation path (breadcrumb) leading to this node. That makes the breadcrumbs transient, and you must not use transient attributes or behavior as (parts of) persistent identifiers, like URIs.

Rest assured Google can (at least will) handle dynamic breadcrumbs without losing the node’s actual context (with regard to search query relevance), so ditching on your static breadcrumb components in URIs will neither lower SERP CTRs nor uglify SERP displays. You’ll get a breadcrumb’ed SERP display even for ugly URIs like example.com/p?id=Hj8TSc&ctx=k0Oh5Ew, because the breadcrumbs are gathered from links, not from path components of URIs.

In other words: your information architecture should be based on estimated (even better: tested!) user behavior, not on your product portfolio or technical indicators. Make short and meaningful URIs for users. Provide short click-paths to all of your contents. Interlink the hell out of your page portfolio. Link for the sake of your sales and easy navigation as well, not hierarchical following alien structures like category/vendor/product/color/size.

If it makes sense, search engines will follow suit. They don’t care much about path components of URIs. A page linked from the root index page or a powerful hub will rank well regardless whether it’s URI is example.com/sku or example.com/products/sku. It’s liked by users when its URI is example.com/green-widget.

You don’t even need to provide a consistent URI pattern. Having a ton of ancient example.com/sku URIs plus fewer example.com/meaningful-string URIs for newer products as well is fine with the engines, and fine with your users. As long as your linkage is user friendly.

Think! Be creative. Don’t fall for ancient sagas.

No Bullshit? Then just Share it!
  • SEO Toxin: Directory like URI Structures

Topics: Server-side silliness | 46 Comments »

  • http://twitter.com/kevinmspence Kevin Spence

    Great post. I especially liked your liberal use of expletives, but that’s beside the point.

    Mimicking a directory structure gives an observant user a decent indication of where a page falls within the whole of your site, and if you use breadcrumbs, it gives extra internal links to your meaty ‘category pages’ — this is all.

    Pagerank is so 2004. And to give you an idea of how long ago that was, I had friends then. Which says both a lot and nothing at the same time.

  • http://twitter.com/kevinmspence Kevin Spence

    Great post. I especially liked your liberal use of expletives, but that’s beside the point.

    Mimicking a directory structure gives an observant user a decent indication of where a page falls within the whole of your site, and if you use breadcrumbs, it gives extra internal links to your meaty ‘category pages’ — this is all.

    Pagerank is so 2004. And to give you an idea of how long ago that was, I had friends then. Which says both a lot and nothing at the same time.

  • http://dynamical.biz/blog/ Ani Lopez

    I agree in when you say “your information architecture should be based on estimated user behavior”.
    “even better: tested!” of course but if you are managing a huge site redirections game would become a nightmare so benefits would be quite difficult to measure.

    Directories like structure, while reasonable, create fantastic room for content optimization (again, specially for big sites) so I would not like people believe from this article they are not useful at all and embrace another stupid SEO fashion.

    Said that I guess we’re going to enjoy this site.

  • http://dynamical.biz/blog/ Ani Lopez

    I agree in when you say “your information architecture should be based on estimated user behavior”.
    “even better: tested!” of course but if you are managing a huge site redirections game would become a nightmare so benefits would be quite difficult to measure.

    Directories like structure, while reasonable, create fantastic room for content optimization (again, specially for big sites) so I would not like people believe from this article they are not useful at all and embrace another stupid SEO fashion.

    Said that I guess we’re going to enjoy this site.

  • http://www.SEOmofo.com/ SEO Mofo

    You’re just crabby because the nodes in my webbed structure have prettier URIs than the nodes in your webbed structure.

  • http://www.SEOmofo.com SEO Mofo

    You’re just crabby because the nodes in my webbed structure have prettier URIs than the nodes in your webbed structure.

  • Toot

    Eh? In websites with varied types of content, you need a formalised directory structure so you can route the web application to show the right content on the right page.

    example.com/green-widget offers no “pattern” for the web app to determine whether that page should return a product or an article (for example).

    Whereas:

    example.com/products/green-widget tells the app that the it is a product page about green widgets (the “/products/” part of the url is the pattern)

    and

    example.com/article/green-widget tells the app that the page is an article page about green widgets. (the “/article/” part of the url is the pattern)

    These forms of urls have very little to do with SEO and have everything to do with building a web app that does what it is suppoosed to.

  • Toot

    Eh? In websites with varied types of content, you need a formalised directory structure so you can route the web application to show the right content on the right page.

    example.com/green-widget offers no “pattern” for the web app to determine whether that page should return a product or an article (for example).

    Whereas:

    example.com/products/green-widget tells the app that the it is a product page about green widgets (the “/products/” part of the url is the pattern)

    and

    example.com/article/green-widget tells the app that the page is an article page about green widgets. (the “/article/” part of the url is the pattern)

    These forms of urls have very little to do with SEO and have everything to do with building a web app that does what it is suppoosed to.

  • http://www.seodeluxe.de Seodeluxe

    Hey buddy, you wrote: “…and search engines won’t index any product page, because five redirects in a row is the maximum.” Due to my own tests I found out that seven is the maximum for Google, but five are far enough I think :-)
    Greetings from Germany,
    Marcell

  • http://sebastians-pamphlets.com/ Sebastian

    There may be cases where breadcrumbs in URIs make sense, no rule without exceptions applies here, too. It is, however, very unlikely. Usually we’re dealing with historical contamination then.

    Breadcrumb navigation with pretty URIs that don’t contain any categorizing is doable, and it’s a sensible thing to do.

    As for PageRank, given it’s so 2004, you can still see its bad impacts on gazillions of weak site structures “optimized” for toolbar PageRank.

  • http://sebastians-pamphlets.com/ Sebastian

    There may be cases where breadcrumbs in URIs make sense, no rule without exceptions applies here, too. It is, however, very unlikely. Usually we’re dealing with historical contamination then.

    Breadcrumb navigation with pretty URIs that don’t contain any categorizing is doable, and it’s a sensible thing to do.

    As for PageRank, given it’s so 2004, you can still see its bad impacts on gazillions of weak site structures “optimized” for toolbar PageRank.

  • http://sebastians-pamphlets.com/ SebastianX

    Toot, there are so many ways to skin a cat.

  • http://sebastians-pamphlets.com/ SebastianX

    Toot, there are so many ways to skin a cat.

  • Toot

    SebaastianX: Routing is routing. Any developer who is asked to produce a site with urls that have no logical and easy routing structure is just going to laugh and/or charge you a lot more for what is essentially a cosmetic difference (I’ve been the one being laughed at…)

    Also considering Google utilises breadcrumbs in their own serps (one assumes for user benefit), ignoring them for the newest SEO craze would, in my opinion, be rather foolish.

  • Toot

    SebaastianX: Routing is routing. Any developer who is asked to produce a site with urls that have no logical and easy routing structure is just going to laugh and/or charge you a lot more for what is essentially a cosmetic difference (I’ve been the one being laughed at…)

    Also considering Google utilises breadcrumbs in their own serps (one assumes for user benefit), ignoring them for the newest SEO craze would, in my opinion, be rather foolish.

  • http://www.activetraffic.de/ activetraffic

    damn right, it just doesn´t matter for rankings!
    the only positive effect is (once you have good rankings), that clickrates might be higher because of “speaking” URls…

  • http://www.activetraffic.de activetraffic

    damn right, it just doesn´t matter for rankings!
    the only positive effect is (once you have good rankings), that clickrates might be higher because of “speaking” URls…

  • http://sebastians-pamphlets.com/ Sebastian

    Toot, did you get the impression that breadcrumb navigation at all is crap? I don’t agree. Prolly you’ve just skimmed the post.

  • http://sebastians-pamphlets.com/ Sebastian

    Toot, did you get the impression that breadcrumb navigation at all is crap? I don’t agree. Prolly you’ve just skimmed the post.

  • Toot

    Sebastian: No, I read the whole post. I just think your apprasial of breadcrumb naviagation is perhaps more combative than perhaps you think it looks (you are pretty harsh)

    I’m defending breadcrumb/hierarchical navigation from the point of view of the development of medium size/large sites that need definable routing patterns to display content in a logical manner. You make no mention of this in your post and how costly not developing like this can be.

    I’m not necessarily disagreeing with you. :)

  • Toot

    Sebastian: No, I read the whole post. I just think your apprasial of breadcrumb naviagation is perhaps more combative than perhaps you think it looks (you are pretty harsh)

    I’m defending breadcrumb/hierarchical navigation from the point of view of the development of medium size/large sites that need definable routing patterns to display content in a logical manner. You make no mention of this in your post and how costly not developing like this can be.

    I’m not necessarily disagreeing with you. :)

  • http://sebastians-pamphlets.com/ Sebastian

    Well, breadcrumb navigation is a great thing to do, if and where it makes sense, and that can be in particular site areas or site-wide. That doesn’t mean that navigation path components should become URI path components. Especially not when depending on different views on the contents many paths can lead to a piece of content, regardless whether that’s obvious at the time of the first IA draft, or a possible future insight. If the implementation of flexible nodes (that don’t rely on URI structures to know what to do) seems too expensive, probably a second look at the IA, the development team, and chosen instruments/tools makes sense.

  • http://sebastians-pamphlets.com/ Sebastian

    Well, breadcrumb navigation is a great thing to do, if and where it makes sense, and that can be in particular site areas or site-wide. That doesn’t mean that navigation path components should become URI path components. Especially not when depending on different views on the contents many paths can lead to a piece of content, regardless whether that’s obvious at the time of the first IA draft, or a possible future insight. If the implementation of flexible nodes (that don’t rely on URI structures to know what to do) seems too expensive, probably a second look at the IA, the development team, and chosen instruments/tools makes sense.

  • http://sebastians-pamphlets.com/ Sebastian

    Google follows more than 5 redirects, but Bing for example stops at the 5th redirect in a row. More info at w3.org.

  • http://www.appliedseo.com JohnCarcutt

    “useless 301 orgies” .. Officially the phrase of the week! Great write up Sebastian.

  • http://www.twentysixsearch.com/ Jono Alderson

    But.. but.. the status quo! What have you done? :(

  • jillwhalen

    Yep…that's one of the myths in my SEO presentation of the same. It's mind boggling how many of these myths we see propagated every day. Worse yet, it can be from some very popular SEO community sites.

  • adamjhumphreys

    Breadcrumbs leave a lot of room for canonical errors, and a lot more work. Siloing however does make sense as long as it's not more than 3 clicks deep for user friendly reasons. It's been suggested that the deeper the link content the less relevant Google considers the content. I however can not confirm this but Matt Cutts implied it in one of his videos. I believe what he was suggesting is the user friendly links side of things especially in reference to getting to content in 3 clicks or less. Taxonomy in relevant structure helps search engines/users to easily determine what directories are relevant to user queries. This is especially true for geolocation with /ca/ /us/ /de/. Using webmaster tools we're able to use these heirarchies to specify regional information so in certain circumstances the URL structure is important for functional reasons.

    I believe you're right about search engines changing their algorithms for overly optimized sites as they're trying more, and more to indentify more so by content rather than SEO tweaks. I think we can expect the Google slap a lot more readily in the future.

    While URLS are not especially important when considering stuffing in keywords(especially if over optimized) they are a variable of significance when it come to being user friendly. Google highlights the keyword content in URLS which indicates that it does in fact find it relevant. Google is also now highlighting synonyms more so than in the past which indicates that LSI is also a factor when considering variable expert verbiage pertinent to the content at hand. Keyword research to determine the most powerful of these synonyms with Google Wonderwheel, SERP pages, SEOmoz, and Adwords tool should give strong determining factor of what keywords to be using.

    It does not make sense to make duplicate content pages just for different keywords such as /XL/ /XXL/, as that's just setting yourself up to get wiped out by the next site that uses a single page. It does however make sense to structure your site in a user-friendly manner that helps both your users, and search engines communicate more effectively.

  • http://twitter.com/mosquitohawk Marty Martin

    Preach it brother!

  • John F. Jones III

    Sebastian – haven't read something of yours in a long time and like in the past, I wasn't dissapointed.

    I've wrote this comment five times over already becaue I find myself agreeing and then disagreeing with some of your points in this post. I'm not sure I'll ever be able to successfully put my words on paper.

    I'll cover two things and hope I make sense:

    #1. Back in 2005 the company I work for LOVED web directories with a straight forward URI structure such as example.com/ca/Riverside-County/Corona/. We felt the Corona page itself along with the county page before it would help boost the relevancy of our customers link on the page.

    #2. Now days when I go back to visit those exact same directories with pretty URI, I find that 70% – 90% of their internal pages aren't even indexed anymore. The biggest fault IMO not being exactly the URI structure but the lack of internal linking. So as you put it.. I fully agree with your statement, “Interlink the hell out of your page portfolio”.

    Well written boss!

  • http://tobto.org/ tobto

    Many good points, Sebastian! Indeed, good linking structure is based on UI. But I would add we go to Semantical Web and our end station is the Giant Global Graph, at least RDF. Semantics is the key.

  • Pingback: SEO Bullshit: Mimicking a file system in URIs

  • jfitz

    The directory structure just helps with site upkeep and organization from a webmaster point of view. It's just good practice. And in turn that can make SEO tasks easier.

    But from an overall myth debunking point of view, you are right – a URI could theoretically be located anywhere, get indexed, and get ranked. Makes no difference.

  • Pingback: Weekly Search & Social News: 02/09/2010 | Search Engine Journal

  • Pingback: Is Wordpress Good of Bad for SEO

  • linusplus

    You know why I like reading your articles ? Because they are full of “bullshit” (the word)…

    No really, you make me smile and laugh while getting useful seo infos at the same time… just great !

  • seanruiz

    I am not a fan of the MVC style URIs. I understand that it's necessary but I also think it's lazy. I'd like to see a Web application that handled URIs with the same intelligence a Web server handled URIs, especially if nearly all of the URIs are going to be routed through one central location.

  • andro

    Thanks for this info. It might differ with many SEOs whom r seeking to increase their on site otpimization .

  • http://eurowonder.com/shampoos.html shampoos

    Thanks for pointing this out, I agree totally~John online logo design

  • JonahClint

    This is a great books collection. Sadly, I don't have enough time to read as much as I'd like to anymore but a few titles really caught my attention.

  • http://www.globalseoconsultants.com/ Global SEO Consultants

    URI is totally new concept for us.Thank for sharing

  • http://www.advanceddatarecovery.co.uk data recovery

    I also liked the post very much.

  • icodeforlove

    lol new concept?

  • JimmieBlake

    Yeah, ugly job

  • http://www.optimierungsuchmaschinen.net SEO Anton

    Thanks Man! Very clear language and academic overview – usually SEOs just post a bunch of unuseful crap but your stuff is great, direct and matches the facts! Cheers, Anton

  • http://security-wire.com/03/how-to-remove-windows-diagnostic-fake-security-program.html remove windows diagnostic

    Nice post there. Raised a few things I hadn’t thought about before. Thx.