![]() |
![]() |
Site Promotion Information |
|
![]() |
![]() |
Search Engine Crawlers and Dynamic Web PagesJerry Yu There are misunderstandings and confusions in the Search Engine Optimization SEO world in regard to search engines indexing of dynamic web pages. It has been claimed that search engine spiders dont index/crawl dynamic web pages well. This statement is only half true. The correct statement should be "Search engines dont index/crawl dynamic web pages well if the page URL contains "" without quotes character.". Search engines do index dynamic web pages very well if the page URL contains no "" characters. URLs that contain "" are called dynamic URLs. What web pages are dynamic If you have knowledge about HTML, you know the web pages you create normally have .htm, or .html, file extension. These files are static because the HTML code dont change on the fly when requested and they are not processed by web servers. They can be viewed without using a web server. A web page is said to be dynamic if it is created by using server-side scripting languages such as php, asp, jsp, perl, cgi and so on. These languages are like normal programming languages such as C++, Java, etc. The major difference is scripting languages cant be compiled beforehand. They can only be processed by web servers on the fly when the page is requested by a visitor. Dynamic pages cant be viewed without a web server. When a dynamic page is requested, the web server first looks at the pages source code and if any server-side scripting code exist, it will process them and generate static HTML result. When processing of the full page has been completed, web server sends only pure HTML code to the web visitors browser. Using scripting languages to create web pages gives you the power to do nearly anything you want. If the dynamic page has no "" character in its URL, search engine spiders treat the page the same as a normal HTML static page. Query string parameters When "" character is used, the pages full URL changes when values after "" change. The portion after "" is called the pages query string parameters, or simply query parameters. Every time when parameters changes, the resulted page will be different. A page URL can contain more than one "" character. When this happens, search engine spiders will have difficult time to index the resulted page. If the page has only one "" character, major search engine spiders can crawl that page well. For example, Google can index and store a pages URL as http://www.examplesite.com/product.aspid=12345. But if the same pages URL is http://www.examplesite.com/product.aspid=12345&category=23&page=3 Most search engines will not be able to index it well even though Googlebot and Yahoo! Slurp may be able to index it. Note: Googlebot is Googles web-crawling robot. Yahoo! Slurp is Yahoos web-crawling robot. Search engine robots collect documents from the web to build a searchable index. Yahoo help says "Yahoo! does index dynamic pages, but for page discovery, our crawler mostly follows static links. We recommend you avoid using dynamically generated links except in directories that are not intended to be crawled/indexed e.g., those should have a /robots.txt exclusion." Googles Webmaster Guidelines: "If you decide to use dynamic pages i.e. the URL contains a "" character, be aware that not every search engine spider crawls dynamic pages as well as static pages. It helps to keep the parameters short and the number of them small." Lets analyze what Google has stated above. 1. the URL contains a "" character: this means the definition of dynamic pages are those containing "" characters in URL. 2. keep the parameters short: this means the number of characters in each individual parameter should be short. There is no quantitative measurement given by Google but we can check some web forums to see examples. My Search engine friendly article http://www.webactionguide/action-guide/build-site/se-friendly.php referenced black hat seo discussion thread on Cre8ASiteForums. Its URL is http://www.cre8asiteforums.com/viewtopic.phpt=8386 This page was crawled by Google. The length of its query parameter is 4 characters. There are many other examples on the internet that have more characters and were crawled successfully. The maximum number of characters that can be accepted by Google is unknown. 3. keep the number of them small: this means we should keep the number of parameters in each URL as small as possible. The above Cre8ASiteForums example has one parameter. At least now we can say Googlebot is able to crawl dynamic pages that have one query parameter and the number of characters in the parameter can be 4. How to get your pages crawled if using query parameters are not avoidable Query parameters are often used for database calls to retrieve stored information by using primary keys in one or more tables. Database Management System DBMS makes some tedious work easy to manage. When query parameters must be used for your site, consider build a site map page and hard code a pages URL. For example, the previous URL can be hard coded as http://www.examplesite.com/product12345-23-3.asp Hand code every dynamic page is time-consuming. If you use Apache web server, there is a Apache mod_rewrite module to help you http://httpd.apache.org/docs/mod/mod_rewrite.html rewrite the requested URL to one with no "" character embedded on the fly. Another mod rewrite resource site is www.modrewrite.com. An interesting article on weberblog.com talked about a practical example of how Google successfully indexed a dynamic page after applying mod_rewrite module. The page originally had 17 characters in the query parameter. Before rewrite: http://www.weberblog.com/article.phpsroty=20040419170030157 After rewrite: http://www.weberblog.com/article.php/20040419170030157 So, if your site is experiencing the same problem, hurry up and implement mod_rewrite now.
| ![]() | ![]() | ![]() | RELATED ARTICLES Taking E-mail marketing to the next level Many types of Internet advertising dont work as well as they once did. People have gotten used to banners and dont click on them. Some ezines have failed to keep their readers interest and ads sometimes get less response. Content is Crucial! Content is undoubtedly the most important element of a website. It is the defining factor for a website, that which catches the interest of your visitors and keeps them hooked to your site. Ten Steps To A Well Optimized Website - Step 10: The Extras Welcome to part ten in this search engine positioning series. Over the past nine weeks we have covered the nine fundamental steps to a proper search engine positioning campaign.From choosing keywords and writing content to optimizing your pages and building quality links we have covered the required steps to attaining solid rankings that will last.In part ten we will cover the extras. Google Ranking WITHOUT ever submitting to Google! A while back, I read an article that explained how to get a good google rating without ever submitting your site to their submission forms. Like you, I was kind of shocked by this statement so I decided to give it a try. The Benefits of Organic SEO Search Engine Optimization: What Organic SEO Entails and WHY It Is Preferred by Some If youre going to sell any type of product or service online, youre eventually going to have to optimize your Web site for the Search Engines, in order to boost traffic and sales. Triple Your Website Traffic Every 90 Days! Proper Use Of Meta tags For A High Google Ranking 8 Ways to Free Targeted Traffic What good is your website if no one knows about it The reason most webmasters fail within their first year is because they overlook advertising and promotion. Just because you have a perfectly designed website does not mean that people will find it. It is not hard to produce a steady stream of targeted traffic, and it can be absolutely free if you do it right. Below are eight free ways to produce quality targeted traffic for your website. FFA/Reversed Marketing Have you ever submitted your ad to an FFA page most likely you have. Did you get any results from it or did you just get a ton of emails in your inbox How To Really SEO Your Site Search engine optimization is one of most popular online marketing tool for any website. With more websites coming online every day the competition gets larger, so you have to make sure you can out SEO them before they out do you. In this article I will tell how to increase your ranking and how to out do your competition. Link Exchanges - What they can do for your business PART 1 For those of you who dont know the definition: Link Exchanges are 2 links back and forth between 2 individual companies that have linked to each other. Make The Search Engines Love Your Site Most webmasters have no idea on how to make a search engine friendly web site. If you are one of them this will all change by following these steps below. Five Revenue Boosting Web Marketing Strategies To maximize the selling power of your web site you need more than just a professional web site and great products -- you must implement strategies that produce more dollar revenue per customer. Top 5 Methods to Promote Your Website Don’t make the mistake of ignoring website promotion. Every website is selling something, be it a product or a service, or maybe simply trust or a brand. This means you need visitors; the more targeted, the better. Simply creating a great site isn’t enough, although it’s an excellent first step. You need to be out there actively promoting it if you truly want to achieve revenue through your site. ThinkExist.com: New “Sex and the City” Quotes Collection ThinkExist.com offers an unparalleled collection of “Sex and The City” quotes. Featuring the best sound bites from Carrie, Miranda, Charlotte and Samantha, our quotes selection also showcases some of the most popular supporting characters, such as, Mr. Big, Trey and Stanford. The Testimonial Trick One of the most popular methods of traffic generation is the link exchange. Not only can they generate fairly targeted traffic from other related web sites, the number of quality links to your site is also a major factor determining your web sites ranking in major search engines like Google. How Can Search Engines Help You with Your Business What Are Search Engines Submitting Your Website Now that your site is built properly, it is time to submit it to be indexed. One common approach is the use of automated submission software, which is not a good idea and instead I suggest performing manual submissions. Below are links to the major search engines that you should submit to. It could take anywhere from one to six weeks before you get indexed, but if you trade links with other sites it will speed it up. You can join http://www.linkexchangeit.com to trade links with other webmasters for free. Building Link Popularity As Easy As 123 The most difficult area of SEO is building link popularity. You may be thinking why Because there are no easy ways to build link popularity. In this article I will tell you how to get links and how to find them. Search Engine Optimization Meets Online PR We all know by now that content is king. In order to effectively market your products or services and to establish yourself as a leader in your industry, writing and distributing articles is the way to go. One of the primary benefits of getting your content rich articles indexed in the search engines is so that you can increase your websites’ overall PageRank, while continuing to build exposure for your company. Google Loosing Fan Base "Nothing last forever but the Earth and sky." - from Dust in the Wind by Kansas. ![]() |
home | site map |
© 2005 |