nakedcapitalism.com
robots.txt

Robots Exclusion Standard data for nakedcapitalism.com

Resource Scan

Scan Details

Site Domain nakedcapitalism.com
Base Domain nakedcapitalism.com
Scan Status Ok
Last Scan2024-05-26T20:46:08+00:00
Next Scan 2024-06-02T20:46:08+00:00

Last Scan

Scanned2024-05-26T20:46:08+00:00
URL https://nakedcapitalism.com/robots.txt
Domain IPs 104.26.12.125, 104.26.13.125, 172.67.72.161, 2606:4700:20::681a:c7d, 2606:4700:20::681a:d7d, 2606:4700:20::ac43:48a1
Response IP 104.26.13.125
Found Yes
Hash efae2b430fb71f10e45fdd01248eba24867a26e89068b60c91fdbff6f58bd69a
SimHash 4c7def42c2ae

Groups

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider+(+http://www.baidu.com/search/spider.htm)

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /trackback/
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /author/bl*
Disallow */feed
Disallow */feed/
Disallow */trackback/
Disallow /*?*
Disallow /*?

Other Records

Field Value
crawl-delay 6

Comments

  • Disallow all files with ? in url