columbian.com
robots.txt

Robots Exclusion Standard data for columbian.com

Resource Scan

Scan Details

Site Domain columbian.com
Base Domain columbian.com
Scan Status Ok
Last Scan2024-11-09T00:20:39+00:00
Next Scan 2024-11-16T00:20:39+00:00

Last Scan

Scanned2024-11-09T00:20:39+00:00
URL https://columbian.com/robots.txt
Redirect https://www.columbian.com/robots.txt
Redirect Domain www.columbian.com
Redirect Base columbian.com
Domain IPs 172.232.168.173
Redirect IPs 3.165.82.15, 3.165.82.39, 3.165.82.42, 3.165.82.50
Response IP 3.165.82.39
Found Yes
Hash 2672fc5a4e87f5bcb37bdddf8f99aee1f80f4d91e42194599ef878424c7af2d8
SimHash 4a34dc6f0699

Groups

*

Rule Path
Allow facebookexternalhit
Disallow /wp-admin/
Disallow /trackback/
Disallow /xmlrpc.php
Disallow /feed/
Disallow /search/
Disallow /accounts/login/digital/?returnUrl=*

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

gnowitnewsbot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

voilabot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

orangebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

orangebot-collector

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msnbot-media

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

googlebot-image

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

archive.org_bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

seekportbot

Rule Path
Disallow /