itcertrocket.com
robots.txt

Robots Exclusion Standard data for itcertrocket.com

Resource Scan

Scan Details

Site Domain itcertrocket.com
Base Domain itcertrocket.com
Scan Status Ok
Last Scan2025-10-12T19:44:35+00:00
Next Scan 2025-10-19T19:44:35+00:00

Last Scan

Scanned2025-10-12T19:44:35+00:00
URL https://itcertrocket.com/robots.txt
Domain IPs 198.54.114.217
Response IP 198.54.114.217
Found Yes
Hash 99d8af781f42fa1c40e89203994fbeb300616b78cdb9374f9d7e78d4ffc28344
SimHash 24475f50a5e2

Groups

*

Rule Path
Allow /robots.txt
Allow /sitemap.xml
Allow /assets/
Allow /images/
Allow /css/
Allow /js/
Disallow /admin/
Disallow /login/
Disallow /duplicate-page/
Disallow /*?sessionid=
Disallow /*?*
Disallow /*?faq=
Disallow /search/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/

googlebot

Rule Path
Disallow /*?faq=
Disallow /*?
Disallow /*?*
Disallow /*?sessionid=
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/
Allow /faq.php
Allow /sitemap.xml
Allow /robots.txt
Allow /css/
Allow /js/
Allow /images/

bingbot

Rule Path
Disallow /admin/
Disallow /login/
Disallow /duplicate-page/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/
Allow /faq.php

Other Records

Field Value
crawl-delay 10

baidu

Rule Path
Disallow /admin/
Disallow /login/
Disallow /duplicate-page/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/
Allow /sitemap.xml
Allow /faq.php

yandex

Rule Path
Disallow /admin/
Disallow /login/
Disallow /duplicate-page/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/
Allow /sitemap.xml
Allow /faq.php

naver

Rule Path
Disallow /admin/
Disallow /login/
Disallow /duplicate-page/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/
Allow /faq.php
Allow /en/
Allow /fr/
Allow /de/
Allow /es/

facebookexternalhit

Rule Path
Allow /images/
Allow /js/
Allow /css/
Disallow /admin/
Disallow /login/

twitterbot

Rule Path
Allow /images/
Allow /js/
Allow /css/
Disallow /admin/
Disallow /login/

*

Rule Path
Disallow /.PRIVATE/
Disallow /private/
Disallow /backup/
Allow /images/
Allow /styles/
Allow /scripts/
Allow /page/
Disallow /*?sessionid=
Disallow /*%26sessionid%3D
Disallow /*?tracking=
Disallow /*%26tracking%3D

*

Rule Path
Disallow /private/
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/
Allow /faq.php
Allow /sitemap.xml
Disallow /search/
Disallow /cart/
Disallow /checkout/
Disallow /order/
Disallow /profile/
Disallow /duplicate-page/
Disallow /sessionid/

googlebot

Rule Path
Disallow /*?faq=
Disallow /*?sessionid=
Disallow /*?*
Disallow /admin/
Disallow /login/
Disallow /duplicate-page/
Disallow /checkout/
Disallow /user-account/
Disallow /profile/
Disallow /cart/
Allow /faq.php
Allow /sitemap.xml
Allow /robots.txt
Allow /css/
Allow /js/
Allow /images/

bingbot

Rule Path
Disallow /admin/
Disallow /login/
Disallow /duplicate-page/
Allow /faq.php

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /.PRIVATE/
Allow /faq.php
Allow /sitemap.xml
Allow /robots.txt
Disallow /cgi-bin/
Disallow /tmp/
Disallow /junk/
Disallow /private/
Disallow /tmp/
Disallow /backup/
Disallow /search/
Disallow /cart/
Disallow /checkout/
Disallow /order/
Disallow /login/
Disallow /profile/
Disallow /admin/
Allow /images/
Allow /styles/
Allow /scripts/
Allow /page/
Disallow /*?sessionid=
Disallow /*%26sessionid%3D

*

Rule Path
Disallow /.PRIVATE/
Disallow /private/
Disallow /backup/
Allow /images/
Allow /styles/
Allow /scripts/
Allow /page/
Disallow /*?sessionid=
Disallow /*%26sessionid%3D

Other Records

Field Value Comment
sitemap https://itcertrocket.com/sitemap.xml -
sitemap https://itcertrocket.com/sitemap-en.xml English version sitemap
sitemap https://itcertrocket.com/sitemap-fr.xml French version sitemap
sitemap https://itcertrocket.com/sitemap-de.xml German version sitemap
sitemap https://itcertrocket.com/sitemap-es.xml Spanish version sitemap
sitemap https://itcertrocket.com/sitemap.xml -
sitemap https://itcertrocket.com/sitemap.xml -
sitemap https://itcertrocket.com/sitemap.xml -

Comments

  • General Configuration for All Crawlers (Global SEO)
  • Allow access to essential files like robots.txt and sitemap.xml for proper indexing
  • Allow Crawling of All Image, CSS, JS Resources (important for Google, Bing, and other search engines)
  • Block Unwanted or Duplicate Pages (Preventing duplicate content)
  • Googlebot Specific Configuration
  • Block unnecessary dynamic content (FAQ, sessionid, query params) to avoid duplicate indexing
  • Allow Google to crawl essential pages like FAQ, Sitemap, and other relevant content
  • Allow Googlebot to crawl CSS, JS, and image files for better page rendering and indexing
  • Sitemap Declaration for Googlebot
  • Bingbot Specific Configuration
  • Block Bingbot from sensitive pages like admin or login
  • Allow Bingbot to crawl FAQ page explicitly
  • Define crawl delay to avoid overloading the server
  • Regional and Language-Specific Rules for Global SEO
  • This ensures international search engines like Baidu, Yandex, or regional search engines can crawl localized versions
  • Add hreflang-specific sitemaps for international pages (if applicable)
  • Allowing important URLs related to specific countries and languages
  • Social Media & Crawlers Specific Rules (helps social media bots understand your content)
  • Block All Crawlers from Private Content (Sensitive Data)
  • Allow crawlers to access and index common content like images, CSS, and JavaScript
  • Sitemap for all crawlers
  • Provide some other best practices
  • Allow crawlers to access and crawl paginated content to help with SEO
  • Block any non-content URLs (e.g., tracking, session IDs) from being indexed
  • Block All Crawlers from Sensitive or Hidden Content (Sensitive Data)
  • Allow crawlers to access essential content like FAQ page, Sitemap, etc.
  • Ensure crawlers do not index search, cart, or checkout pages
  • Prevent indexing of duplicate or session-related content
  • Googlebot Configuration (SEO Best Practices)
  • Block search engines from crawling dynamic URL parameters (e.g., tracking or sorting parameters)
  • Block Google from crawling any admin or login pages to prevent indexing of sensitive content
  • Allow Google to crawl essential pages like FAQ, Sitemap, and other relevant content
  • Allow Googlebot to crawl CSS, JS, and image files for better page rendering and indexing
  • Bingbot Configuration
  • Block Bingbot from sensitive pages like admin or login
  • Allow Bingbot to crawl FAQ page explicitly
  • Define crawl delay to avoid overloading the server
  • General Rules for All Crawlers
  • Block any private or hidden content directories
  • Allow crawlers to access and index FAQ page and Sitemap
  • Block specific content from all bots (e.g., unnecessary files or scripts)
  • Block private directories or files from being indexed
  • Additional optimizations for all crawlers
  • Let crawlers access and index common content like images, CSS, and JavaScript
  • Sitemap for all crawlers
  • Provide some other best practices
  • Allow crawlers to access and crawl paginated content to help with SEO
  • Block any non-content URLs (e.g., tracking, session IDs) from being indexed
  • Block All Crawlers from Private Content (Sensitive Data)
  • Allow crawlers to access and index common content like images, CSS, and JavaScript
  • Sitemap for all crawlers
  • Provide some other best practices
  • Allow crawlers to access and crawl paginated content to help with SEO
  • Block any non-content URLs (e.g., tracking, session IDs) from being indexed