checkbook.org
robots.txt

Robots Exclusion Standard data for checkbook.org

Resource Scan

Scan Details

Site Domain checkbook.org
Base Domain checkbook.org
Scan Status Ok
Last Scan2024-09-25T16:46:05+00:00
Next Scan 2024-10-25T16:46:05+00:00

Last Scan

Scanned2024-09-25T16:46:05+00:00
URL https://checkbook.org/robots.txt
Redirect https://www.checkbook.org/robots.txt
Redirect Domain www.checkbook.org
Redirect Base checkbook.org
Domain IPs 104.20.62.177
Redirect IPs 104.20.62.177, 104.20.63.177
Response IP 104.20.63.177
Found Yes
Hash 7d6df0c7793659827b0f123800fdab5ab253846ee962cbfc5732109ef4871b18
SimHash 0b1cd48467e0

Groups

*

Product Comment
* All robots
Rule Path
Disallow /cgi-bin/memberonly/
Disallow /interactive/
Disallow /newhig2/docs/
Disallow /interactive/
Disallow /search/autocomplete/

yandex

Rule Path Comment
Disallow / blocks access to whole site

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.checkbook.org/sitemap_index.xml

Comments

  • Blocking Folders
  • Blocking Robots
  • https://megaindex.com/crawler
  • Sitemap file