domainscorporation.com
robots.txt

Robots Exclusion Standard data for domainscorporation.com

Resource Scan

Scan Details

Site Domain domainscorporation.com
Base Domain domainscorporation.com
Scan Status Ok
Last Scan2025-03-07T21:30:13+00:00
Next Scan 2025-04-06T21:30:13+00:00

Last Scan

Scanned2025-03-07T21:30:13+00:00
URL https://domainscorporation.com/robots.txt
Domain IPs 62.182.20.60
Response IP 62.182.20.60
Found Yes
Hash 4e196337f1d05e40a16780a1df6a28516f29592b1b4ec1b0065ed4dfad00a076
SimHash 437c56b0d89b

Groups

bingbot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

slurp

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

wada.vn vietnamese search

Rule Path
Disallow /

ips-agent

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

exabot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

obot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

ioncrawl

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

firmograph

Rule Path
Disallow /

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 36000

Warnings

  • 4 invalid lines.