greenbook.org
robots.txt

Robots Exclusion Standard data for greenbook.org

Resource Scan

Scan Details

Site Domain greenbook.org
Base Domain greenbook.org
Scan Status Ok
Last Scan2024-09-14T21:03:28+00:00
Next Scan 2024-10-14T21:03:28+00:00

Last Scan

Scanned2024-09-14T21:03:28+00:00
URL https://greenbook.org/robots.txt
Redirect https://www.greenbook.org/robots.txt
Redirect Domain www.greenbook.org
Redirect Base greenbook.org
Domain IPs 76.76.21.21
Redirect IPs 104.26.10.202, 104.26.11.202, 172.67.72.125, 2606:4700:20::681a:aca, 2606:4700:20::681a:bca, 2606:4700:20::ac43:487d
Response IP 172.67.72.125
Found Yes
Hash c09e2e1e68cb0c13d4b93e77b86b0a76b8f411442e7a03256aeb3d7bc93d19f0
SimHash e35847622832

Groups

*

Rule Path
Disallow /directory-manager/

*

Rule Path
Disallow /listing-app/

*

Rule Path
Disallow /company/tag/GetCompanyTags

*

Rule Path
Disallow /company/tag/GetCompanyRating

*

Rule Path
Disallow /company/tag/GetCompanyNotes

*

Rule Path
Disallow /DirectoryHome/

*

Rule Path
Disallow /greenbook/

*

Rule Path
Disallow /ads/

*

Rule Path
Disallow /DocumentPlus/

*

Rule Path
Disallow /IIC/

*

Rule Path
Disallow /iiex/

*

Rule Path
Disallow /IIeX2014/

*

Rule Path
Disallow /iiwebinars/

*

Rule Path
Disallow /img/

*

Rule Path
Disallow /company/ThinkNow-Research

*

Rule Path
Disallow /PreviousNextCompany/

*

Rule Path
Disallow /print

*

Rule Path
Disallow /login

*

Rule Path
Disallow /Register

*

Rule Path
Disallow /PressReleases/

*

Rule Path
Disallow /newsletter/editions/

*

Rule Path
Disallow /Company?shortName=*

*

Rule Path
Disallow /*.cfm$

*

Rule Path
Disallow /company/*/print

*

Rule Path
Disallow /company/*/email

*

Rule Path
Disallow /market-research-firms/wp-login.php

*

Rule Path
Disallow /market-research-firms/*?*

*

Rule Path
Disallow /keyword-search-results/*?*

*

Rule Path
Disallow /product/GetRelatedProductNextItems

*

Rule Path
Disallow /mr/other/

*

Rule Path
Disallow /listing-app

*

Rule Path
Disallow /company/preview-*

*

Rule Path
Disallow /*?preview=

*

Rule Path
Disallow /*?q=

*

Rule Path
Disallow /account*

*

Rule Path
Disallow /package-company*

*

Rule Path
Disallow /*?filter=

megaindex.com

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

siteauditbot-desktop

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://www.greenbook.org/sitemap.xml
sitemap https://www.greenbook.org/sitemaps/companies.xml
sitemap https://www.greenbook.org/sitemaps/case-studies.xml
sitemap https://www.greenbook.org/sitemaps/library-items.xml

Comments

  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • *
  • megaindex.com
  • DotBot
  • MJ12bot
  • CCBot
  • MojeekBot
  • AhrefsBot
  • YandexBot
  • Sogou web spider
  • SiteAuditBot-Desktop
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.