Tools and Tactics for Diagnosing Technical Search Issues Vanessa Fox
 
Diagnostic Checklists and Resources Search Accessibility Checklist Search Discoverability Checklist Diagnostic Tools janeandrobot.com
Search Engine Tools Created by NineByBlue.com Google Webmaster Central http://www.google.com/webmasters Microsoft Live Search Webmaster Center http://webmaster.live.com Yahoo! Site Explorer http://siteexplorer.search.yahoo.com Google Analytics http://www.google.com/analytics Google Search http://www.google.com
Ranking and Diagnostic Tools Created by NineByBlue.com SEOBook Rank Checker http://tools.seobook.com/firefox/rank-checker/ Firefox Web Developer Toolbar https://addons.mozilla.org/en-US/firefox/addon/60 Firefox Firebug http://getfirebug.com/ Firefox Live HTTP Headers https://addons.mozilla.org/en-US/firefox/addon/3829 Google Search http://adlab.msn.com/Keyword-Forecast/default.aspx http://janeandrobot.com/resources
How Search  Engines Work
Crawling Discover links Check robots rules Bandwidth considerations URLs Indexing Canonicalization Context extraction Topic association Web-wide value Ranking Relevance Value Uniqueness Display
Search Engine Crawlers Haven’t Quite Grown Up Yet
Crawling Lack of discovery Crawl inefficiency URL issues (infinite, redirects, dynamic) Inaccessible links Indexing Duplication Extraction issues Lack of exposed  content Non-optimized media Ranking Display issues Lack of quality links Guidelines violations Non-focused content
Step 1:  Get the Data Pages crawled Pages indexed Web traffic Key ranking metrics
Crawling Indexing Ranking Which pages have the search engines crawled? What kind of pages are they? Has the search engine indexed all of the crawled pages? How’s the search engine traffic?
Benchmarking Top ten queries that bring search traffic Search results position URL that ranks
 
Crawl Issues
Crawl Log Example: Apache Log Analyzer 2 Feed 1 /**  2 * @see ApacheLogAnalyzer2Feed  3 */  4 require_once 'ApacheLogAnalyzer2Feed.php';  5  6 // create a new instance, parse access. log  and  7 // write test.xml  8 $tool = new ApacheLogAnalyzer2Feed('access. log ', 9 'test.xml');  10 // select entries matching Googlebot useragent  11 $tool->addFilter('User-Agent', 'Mozilla/5.0  12 (compatible; Googlebot/2.1;  13 +http://www.google.com/bot.html)');  14  // run  15 $tool->run(); 12 http://code.simonecarletti.com/wiki/apachelog2feed 1 /**  2 * @see ApacheLogAnalyzer2Feed  3 */  4 require_once 'ApacheLogAnalyzer2Feed.php';  5  6 // create a new instance, parse access. log  and write test.xml  7 $tool = new ApacheLogAnalyzer2Feed('access. log ', 'test.xml');  8 // select entries matching Googlebot useragent with a regular  9 expression pattern  10 $tool->addFilter('User-Agent', 'regexp:Googlebot');  11 // select entries with Request matching a regular expression  12 // pattern  13 $tool->addFilter('Request', 'regexp:/site/profile\.php');  14 // run  15 $tool->run();  16 All Pages Google’s Crawled All Profile Pages Google’s Crawled
Communicating with Search Robots
Extractable Link Issues: Flash
Extractable Link Issues: Images
Extractable Link Issues: AJAX
Extractable Link Issues: URL Errors
Extractable Link Issues: URLs That Expire
Comprehensive external links At least one internal link to every page XML Sitemap referenced in robots.txt with the comprehensive list of canonical URLs Comprehensive HTML sitemap Ensure links load without JavaScript, images, or other rich media Ensure robots.txt and meta robots tag is used correctly URL Discovery  Checklist http://janeandrobot.com/library/managing-robots-access-to-your-website
URL Structure Checklist Keep number of parameters in dynamic URLs short Don’t use temporary URLs that expire Ensure redirects are 301 and are short Use dashes rather than underscores when separating  words Use keywords in URLs for higher  click through and better anchor text
Canonicalization Checklist Have only URL for each page Put all unneeded details in cookies, rather than URLs (session IDs, tracking parameters) Don’t allow infinite parameters Use 301 redirects for any URL changes 301 redirect www/non-www Use absolute URLs for internal links Ensure canonical version is in XML Sitemap Use rel=canonical attribute for optional parameters Block print and other versions with robots.txt http://janeandrobot.com/library/url-referrer-tracking http://searchengineland.com/canonical-tag-16537
Crawl Efficiency Checklist Ensure page load times aren’t slow as to reduce number of pages crawled Ensure server is responsive Return a 304 for unchanged content Use compression Return a 404 for not found content Ensure each page has at least one link Avoid infinite redirects and redirect loops Ensure most important pages are linked from home page No JavaScript redirects or meta refresh redirects (if possible) Reasonable crawl-delay setting (if used at all) Reasonable use of Google Webmaster Tools crawl setting
Indexing Issues
Indexing Example: XML Sitemaps http://sitemaps.org XML Sitemap <?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?>  <urlset xmlns=&quot;http://www.sitemaps.org/schemas/sitemap/0.9&quot;>     <url>        <loc>http://www.example.com/</loc>      </url> <url>        <loc>http://www.example.com/page1.php</loc>    </url>  <url>        <loc>http://www.example.com/page2.php</loc>    </url>  </urlset>  http://www.google.com/webmasters Pages Indexed From Sitemap
Duplicate Content Issues
Partner Content http://www.google.co.uk/search?q=%22The+Radisson+Edwardian+Vanderbilt+Hotel+stands+among+a+row+of+Victorian+townhouses+located+in+the+fashionable+Kensington+district+of+London,+England%22&hs=cN0&filter=0
Indexing Diagnostic Checklist Have the pages ever been indexed? If deindexed, are you sure they are no longer in the index? Is the indexing loss across all engines? What was percentage of loss? Is there a pattern? Check Google Webmaster Tools for errors/blocking Did you change infrastructure/CMS/implement redirects? What’s the linking pattern?
Indexing Checklist: Content Extraction Ensure content is in text wherever possible Ensure text isn’t hidden in: JavaScript/AJAX Flash Video Images Avoid multiple URLs for the same page and very similar pages
Indexing Checklist: Semantic Markup Use keywords in title tag Ensure each page has a unique meta description tag Use keywords in (single) H1 tag Appropriate use of H2 – H6 tag Relevant anchor text in a href tags Put Javascript in .js file (except onclick event functions) and style details in .css Validate HTML to ensure it renders Provide focus for each page Ensure pages provide unique and valuable content beyond boilerplate template and reused content
Optimizing Images Don’t put text in images Use descriptive ALT text Use descriptive filenames Provide caption and surrounding text Be cautious about logo images Consider blocking non-useful images with robots.txt Don’t provide alternate text using CSS that styles the text off the page (such as -9999) http://janeandrobot.com/post/Effectively-Using-Images.aspx
Ranking Issues
How’s the Search Engine Traffic? Overall Percentage Percentage Non-Branded
Do You Rank For the Right Things? arbor snowboards snowboard Google 1 49 500+ Yahoo 1 80 500+ Live Search 3 128 500+
If ranking loss… Drop For All Keywords Does the site rank for different queries than before? Did you substantially change the site content? Did you change the underlying site infrastructure? Was there a large change in linking behavior? Could there be a penalty? Drop For Only Some Keywords Do different pages rank highest than used to rank before? Are the pages that used to rank still indexed?
Ranking Checklist Relevance What is the page about? Are the pages ranking for the desired query more relevant? Do the pages use the language of the searcher? Value How many relevant links (and how authoritative are they?) What’s the value of the page? (do more useful pages rank above yours?) SERP display Are the title and snippet compelling? Do Sitelinks appear for navigational queries? What universal elements appear on the page? Does the site rank for non-branded queries?
The Webmaster Guidelines http://google.com/support/webmasters/bin/answer.py?answer=35769 Common Definition of Spam On page schemes Keyword stuffing Fake/ stolen content Hidden text Hidden links Cloaking Linking schemes Paid Links Link exchanges Doorway pages Deceptive redirects
Getting Out of the Penalty Box Check if you’ve been penalized Live Search:  http://webmaster.live.com   Google:  http://google.com/webmasters   ** maybe ** Review the webmaster guidelines Google ,  Live Search ,  Yahoo Identify the issue Fix it! Request re-evaluation Google:  http://google.com/webmasters   Live Search:  http://webmaster.live.com
Traffic Issues
Traffic Drop
Display Issues
Would you click this link?
Does the Result Inspire Clicks?
First step in diagnosis: find the root
Ninebyblue.com Twitter.com/vanessafox Jane and Robot Developer Summit June 12 th , 2009 – San Francisco FREE for SMX attendees! janeandrobot.com Twitter.com/janeandrobot

More Related Content

PPT
SEO 101 Slideshow
PPTX
Introduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning Catalyst
PPT
Seo Introductions - SEO Basics, SEO Method, SEO Process, SEO Cycle
PPTX
PPTX
Seo types
PPTX
Introduction to SEO
PPTX
An Introduction to seo and How it Works?
PPTX
Search Engine Optimization
SEO 101 Slideshow
Introduction To SEO (SEARCH ENGINE OPTIMIZATION)- Learning Catalyst
Seo Introductions - SEO Basics, SEO Method, SEO Process, SEO Cycle
Seo types
Introduction to SEO
An Introduction to seo and How it Works?
Search Engine Optimization

What's hot (20)

PPSX
Search engine optimization (seo) overview
PPTX
Search Engine Optimization (SEO) Techniques for Churches (a.k.a. Attracting V...
PPTX
Search engine optimization (SEO)
PDF
How to Use the SEO Pyramid Strategy - to Rock Your SEO
PDF
Basic Search Engine Optimization techniques & tips
PPTX
Search Engine Optimization (SEO) -- University of Virginia Presentation
PPTX
Google seo-search-engine-optimization-introduction-powerpoint-presentation
PPTX
SEO 101 - Tips and Tactics for 2014
PPTX
Seo (Search Engine Optimization)
PPTX
SEO Checklist 2018 - Ranking in the first page of SERP organically.
PPT
Ethical Search Engine Optimization | White Hat SEO Tips and Techniques
PPTX
14 Steps to Successful SEO
PDF
Search Engine Optimization - What's it about?
PPT
Search engine optimization (seo)
PPTX
Introduction to SEO
PPTX
Sample SEO presentation for clients
PPT
Search Engine Optimization (SEO)
PDF
Search Engine Optimization {SEO}Lecture Outline
PPT
PPTX
SEO Tutorial For Beginners
Search engine optimization (seo) overview
Search Engine Optimization (SEO) Techniques for Churches (a.k.a. Attracting V...
Search engine optimization (SEO)
How to Use the SEO Pyramid Strategy - to Rock Your SEO
Basic Search Engine Optimization techniques & tips
Search Engine Optimization (SEO) -- University of Virginia Presentation
Google seo-search-engine-optimization-introduction-powerpoint-presentation
SEO 101 - Tips and Tactics for 2014
Seo (Search Engine Optimization)
SEO Checklist 2018 - Ranking in the first page of SERP organically.
Ethical Search Engine Optimization | White Hat SEO Tips and Techniques
14 Steps to Successful SEO
Search Engine Optimization - What's it about?
Search engine optimization (seo)
Introduction to SEO
Sample SEO presentation for clients
Search Engine Optimization (SEO)
Search Engine Optimization {SEO}Lecture Outline
SEO Tutorial For Beginners
Ad

Viewers also liked (18)

PPT
Search Engine Optimization PPT
PPT
Basic SEO Presentation
PPT
SEO PPT
PPTX
Introduction to SEO Presentation
PPTX
S.e.o by h@r$h!l
PDF
The Technical SEO Renaissance
PPTX
The 7 Biggest Trends in SEO: 2016
PPTX
Seo 7 step seo process
PPSX
SEO-all about Search engine optimization
PPT
Seo Marketing Plan Ppt
PPTX
Keeping Up With SEO in 2017 & Beyond
PPTX
10 Powerful Body Language Tips for your next Presentation
PDF
Crap. The Content Marketing Deluge.
PPTX
Search Engine Optimization Tutorial
PPT
Методики «Inversion of Control» и «Dependency Injection». Применение в Spring.
PPTX
Integrating SEO & Ending the Siloed Marketing Approach
PPT
10 SEO Tips Public Relations PRSA09
PPTX
Why Your Brain Is The Best Tool Of All - A Look At 86 Billion Free Tools (#io...
Search Engine Optimization PPT
Basic SEO Presentation
SEO PPT
Introduction to SEO Presentation
S.e.o by h@r$h!l
The Technical SEO Renaissance
The 7 Biggest Trends in SEO: 2016
Seo 7 step seo process
SEO-all about Search engine optimization
Seo Marketing Plan Ppt
Keeping Up With SEO in 2017 & Beyond
10 Powerful Body Language Tips for your next Presentation
Crap. The Content Marketing Deluge.
Search Engine Optimization Tutorial
Методики «Inversion of Control» и «Dependency Injection». Применение в Spring.
Integrating SEO & Ending the Siloed Marketing Approach
10 SEO Tips Public Relations PRSA09
Why Your Brain Is The Best Tool Of All - A Look At 86 Billion Free Tools (#io...
Ad

Similar to Diagnosing Technical Issues With Search Engine Optimization (20)

KEY
Maximising Online Resource Effectiveness Workshop Session 2/8 Conventional SE...
KEY
SEO: SCAmore
PPTX
Google Webmaster Tools
PDF
Search-Friendly Web Development at RubyNation
PPT
Site Architecture Best Practices for Search Findability - Adam Audette
PPT
Chewy Trewella - Google Searchtips
PPTX
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019
PDF
The step by step guide to SEO Website Audit
PDF
SEO for Developers
PDF
Google Webmaster Guidelines 2016 - Updated
PDF
Important Digital Marketing Topics
PPT
Diagnose SEO Issues with Live Search Webmaster Tools
PPTX
Technical SEO Updated
PDF
Prioritising Impactful SEO Tasks
PPTX
SEO 101 | New York University
PPT
Website Optimization -SEO - Step By Step
PPTX
SEO 101 - Google Search Console Explained
PPT
Advanced Seo Web Development Tech Ed 2008
PDF
Technical SEO.pdf
Maximising Online Resource Effectiveness Workshop Session 2/8 Conventional SE...
SEO: SCAmore
Google Webmaster Tools
Search-Friendly Web Development at RubyNation
Site Architecture Best Practices for Search Findability - Adam Audette
Chewy Trewella - Google Searchtips
A Crash Course in Technical SEO from Patrick Stox - Beer & SEO Meetup May 2019
The step by step guide to SEO Website Audit
SEO for Developers
Google Webmaster Guidelines 2016 - Updated
Important Digital Marketing Topics
Diagnose SEO Issues with Live Search Webmaster Tools
Technical SEO Updated
Prioritising Impactful SEO Tasks
SEO 101 | New York University
Website Optimization -SEO - Step By Step
SEO 101 - Google Search Console Explained
Advanced Seo Web Development Tech Ed 2008
Technical SEO.pdf

More from Nine By Blue (7)

PPT
Dev Summit Sf Flash Search V5
PPTX
LAMP Stack and SEO
PPT
Jennifer Lopez Microsoft Stack
PPT
04 Architecting Navigation
PPT
Diagnosing Issues
PPT
Marketing And Development
PPT
The Truth About Seo
Dev Summit Sf Flash Search V5
LAMP Stack and SEO
Jennifer Lopez Microsoft Stack
04 Architecting Navigation
Diagnosing Issues
Marketing And Development
The Truth About Seo

Recently uploaded (20)

PDF
Build Real-Time ML Apps with Python, Feast & NoSQL
PDF
Human Computer Interaction Miterm Lesson
PDF
Altius execution marketplace concept.pdf
PDF
Connector Corner: Transform Unstructured Documents with Agentic Automation
PDF
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
PDF
Decision Optimization - From Theory to Practice
PPTX
SGT Report The Beast Plan and Cyberphysical Systems of Control
PDF
NewMind AI Weekly Chronicles – August ’25 Week IV
PDF
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
PDF
A symptom-driven medical diagnosis support model based on machine learning te...
PDF
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
PDF
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
PPTX
Module 1 Introduction to Web Programming .pptx
PDF
ment.tech-Siri Delay Opens AI Startup Opportunity in 2025.pdf
PDF
Lung cancer patients survival prediction using outlier detection and optimize...
PDF
SaaS reusability assessment using machine learning techniques
PDF
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
PDF
4 layer Arch & Reference Arch of IoT.pdf
PDF
The AI Revolution in Customer Service - 2025
PDF
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf
Build Real-Time ML Apps with Python, Feast & NoSQL
Human Computer Interaction Miterm Lesson
Altius execution marketplace concept.pdf
Connector Corner: Transform Unstructured Documents with Agentic Automation
Dell Pro Micro: Speed customer interactions, patient processing, and learning...
Decision Optimization - From Theory to Practice
SGT Report The Beast Plan and Cyberphysical Systems of Control
NewMind AI Weekly Chronicles – August ’25 Week IV
Transform-Quality-Engineering-with-AI-A-60-Day-Blueprint-for-Digital-Success.pdf
A symptom-driven medical diagnosis support model based on machine learning te...
Transform-Your-Factory-with-AI-Driven-Quality-Engineering.pdf
Transform-Your-Streaming-Platform-with-AI-Driven-Quality-Engineering.pdf
Module 1 Introduction to Web Programming .pptx
ment.tech-Siri Delay Opens AI Startup Opportunity in 2025.pdf
Lung cancer patients survival prediction using outlier detection and optimize...
SaaS reusability assessment using machine learning techniques
The-2025-Engineering-Revolution-AI-Quality-and-DevOps-Convergence.pdf
4 layer Arch & Reference Arch of IoT.pdf
The AI Revolution in Customer Service - 2025
Aug23rd - Mulesoft Community Workshop - Hyd, India.pdf

Diagnosing Technical Issues With Search Engine Optimization

  • 1. Tools and Tactics for Diagnosing Technical Search Issues Vanessa Fox
  • 2.  
  • 3. Diagnostic Checklists and Resources Search Accessibility Checklist Search Discoverability Checklist Diagnostic Tools janeandrobot.com
  • 4. Search Engine Tools Created by NineByBlue.com Google Webmaster Central http://www.google.com/webmasters Microsoft Live Search Webmaster Center http://webmaster.live.com Yahoo! Site Explorer http://siteexplorer.search.yahoo.com Google Analytics http://www.google.com/analytics Google Search http://www.google.com
  • 5. Ranking and Diagnostic Tools Created by NineByBlue.com SEOBook Rank Checker http://tools.seobook.com/firefox/rank-checker/ Firefox Web Developer Toolbar https://addons.mozilla.org/en-US/firefox/addon/60 Firefox Firebug http://getfirebug.com/ Firefox Live HTTP Headers https://addons.mozilla.org/en-US/firefox/addon/3829 Google Search http://adlab.msn.com/Keyword-Forecast/default.aspx http://janeandrobot.com/resources
  • 6. How Search Engines Work
  • 7. Crawling Discover links Check robots rules Bandwidth considerations URLs Indexing Canonicalization Context extraction Topic association Web-wide value Ranking Relevance Value Uniqueness Display
  • 8. Search Engine Crawlers Haven’t Quite Grown Up Yet
  • 9. Crawling Lack of discovery Crawl inefficiency URL issues (infinite, redirects, dynamic) Inaccessible links Indexing Duplication Extraction issues Lack of exposed content Non-optimized media Ranking Display issues Lack of quality links Guidelines violations Non-focused content
  • 10. Step 1: Get the Data Pages crawled Pages indexed Web traffic Key ranking metrics
  • 11. Crawling Indexing Ranking Which pages have the search engines crawled? What kind of pages are they? Has the search engine indexed all of the crawled pages? How’s the search engine traffic?
  • 12. Benchmarking Top ten queries that bring search traffic Search results position URL that ranks
  • 13.  
  • 15. Crawl Log Example: Apache Log Analyzer 2 Feed 1 /** 2 * @see ApacheLogAnalyzer2Feed 3 */ 4 require_once 'ApacheLogAnalyzer2Feed.php'; 5 6 // create a new instance, parse access. log and 7 // write test.xml 8 $tool = new ApacheLogAnalyzer2Feed('access. log ', 9 'test.xml'); 10 // select entries matching Googlebot useragent 11 $tool->addFilter('User-Agent', 'Mozilla/5.0 12 (compatible; Googlebot/2.1; 13 +http://www.google.com/bot.html)'); 14 // run 15 $tool->run(); 12 http://code.simonecarletti.com/wiki/apachelog2feed 1 /** 2 * @see ApacheLogAnalyzer2Feed 3 */ 4 require_once 'ApacheLogAnalyzer2Feed.php'; 5 6 // create a new instance, parse access. log and write test.xml 7 $tool = new ApacheLogAnalyzer2Feed('access. log ', 'test.xml'); 8 // select entries matching Googlebot useragent with a regular 9 expression pattern 10 $tool->addFilter('User-Agent', 'regexp:Googlebot'); 11 // select entries with Request matching a regular expression 12 // pattern 13 $tool->addFilter('Request', 'regexp:/site/profile\.php'); 14 // run 15 $tool->run(); 16 All Pages Google’s Crawled All Profile Pages Google’s Crawled
  • 21. Extractable Link Issues: URLs That Expire
  • 22. Comprehensive external links At least one internal link to every page XML Sitemap referenced in robots.txt with the comprehensive list of canonical URLs Comprehensive HTML sitemap Ensure links load without JavaScript, images, or other rich media Ensure robots.txt and meta robots tag is used correctly URL Discovery Checklist http://janeandrobot.com/library/managing-robots-access-to-your-website
  • 23. URL Structure Checklist Keep number of parameters in dynamic URLs short Don’t use temporary URLs that expire Ensure redirects are 301 and are short Use dashes rather than underscores when separating words Use keywords in URLs for higher click through and better anchor text
  • 24. Canonicalization Checklist Have only URL for each page Put all unneeded details in cookies, rather than URLs (session IDs, tracking parameters) Don’t allow infinite parameters Use 301 redirects for any URL changes 301 redirect www/non-www Use absolute URLs for internal links Ensure canonical version is in XML Sitemap Use rel=canonical attribute for optional parameters Block print and other versions with robots.txt http://janeandrobot.com/library/url-referrer-tracking http://searchengineland.com/canonical-tag-16537
  • 25. Crawl Efficiency Checklist Ensure page load times aren’t slow as to reduce number of pages crawled Ensure server is responsive Return a 304 for unchanged content Use compression Return a 404 for not found content Ensure each page has at least one link Avoid infinite redirects and redirect loops Ensure most important pages are linked from home page No JavaScript redirects or meta refresh redirects (if possible) Reasonable crawl-delay setting (if used at all) Reasonable use of Google Webmaster Tools crawl setting
  • 27. Indexing Example: XML Sitemaps http://sitemaps.org XML Sitemap <?xml version=&quot;1.0&quot; encoding=&quot;UTF-8&quot;?> <urlset xmlns=&quot;http://www.sitemaps.org/schemas/sitemap/0.9&quot;>     <url>        <loc>http://www.example.com/</loc>     </url> <url>        <loc>http://www.example.com/page1.php</loc>   </url> <url>        <loc>http://www.example.com/page2.php</loc>   </url> </urlset> http://www.google.com/webmasters Pages Indexed From Sitemap
  • 30. Indexing Diagnostic Checklist Have the pages ever been indexed? If deindexed, are you sure they are no longer in the index? Is the indexing loss across all engines? What was percentage of loss? Is there a pattern? Check Google Webmaster Tools for errors/blocking Did you change infrastructure/CMS/implement redirects? What’s the linking pattern?
  • 31. Indexing Checklist: Content Extraction Ensure content is in text wherever possible Ensure text isn’t hidden in: JavaScript/AJAX Flash Video Images Avoid multiple URLs for the same page and very similar pages
  • 32. Indexing Checklist: Semantic Markup Use keywords in title tag Ensure each page has a unique meta description tag Use keywords in (single) H1 tag Appropriate use of H2 – H6 tag Relevant anchor text in a href tags Put Javascript in .js file (except onclick event functions) and style details in .css Validate HTML to ensure it renders Provide focus for each page Ensure pages provide unique and valuable content beyond boilerplate template and reused content
  • 33. Optimizing Images Don’t put text in images Use descriptive ALT text Use descriptive filenames Provide caption and surrounding text Be cautious about logo images Consider blocking non-useful images with robots.txt Don’t provide alternate text using CSS that styles the text off the page (such as -9999) http://janeandrobot.com/post/Effectively-Using-Images.aspx
  • 35. How’s the Search Engine Traffic? Overall Percentage Percentage Non-Branded
  • 36. Do You Rank For the Right Things? arbor snowboards snowboard Google 1 49 500+ Yahoo 1 80 500+ Live Search 3 128 500+
  • 37. If ranking loss… Drop For All Keywords Does the site rank for different queries than before? Did you substantially change the site content? Did you change the underlying site infrastructure? Was there a large change in linking behavior? Could there be a penalty? Drop For Only Some Keywords Do different pages rank highest than used to rank before? Are the pages that used to rank still indexed?
  • 38. Ranking Checklist Relevance What is the page about? Are the pages ranking for the desired query more relevant? Do the pages use the language of the searcher? Value How many relevant links (and how authoritative are they?) What’s the value of the page? (do more useful pages rank above yours?) SERP display Are the title and snippet compelling? Do Sitelinks appear for navigational queries? What universal elements appear on the page? Does the site rank for non-branded queries?
  • 39. The Webmaster Guidelines http://google.com/support/webmasters/bin/answer.py?answer=35769 Common Definition of Spam On page schemes Keyword stuffing Fake/ stolen content Hidden text Hidden links Cloaking Linking schemes Paid Links Link exchanges Doorway pages Deceptive redirects
  • 40. Getting Out of the Penalty Box Check if you’ve been penalized Live Search: http://webmaster.live.com Google: http://google.com/webmasters ** maybe ** Review the webmaster guidelines Google , Live Search , Yahoo Identify the issue Fix it! Request re-evaluation Google: http://google.com/webmasters Live Search: http://webmaster.live.com
  • 44. Would you click this link?
  • 45. Does the Result Inspire Clicks?
  • 46. First step in diagnosis: find the root
  • 47. Ninebyblue.com Twitter.com/vanessafox Jane and Robot Developer Summit June 12 th , 2009 – San Francisco FREE for SMX attendees! janeandrobot.com Twitter.com/janeandrobot

Editor's Notes

  • #2: who in audience is: seo Dev Product Know a lot about seo, etc.