uslawexplained.com
robots.txt

Robots Exclusion Standard data for uslawexplained.com

Resource Scan

Scan Details

Site Domain uslawexplained.com
Base Domain uslawexplained.com
Scan Status Ok
Last Scan2026-03-02T06:01:15+00:00
Next Scan 2026-03-09T06:01:15+00:00

Last Scan

Scanned2026-03-02T06:01:15+00:00
URL https://uslawexplained.com/robots.txt
Domain IPs 104.21.4.38, 172.67.131.159, 2606:4700:3036::6815:426, 2606:4700:3036::ac43:839f
Response IP 172.67.131.159
Found Yes
Hash 6ba973138938507116fd1c62572010f46ca14a4e48012a342fb063594bcd8a6c
SimHash 1c65d0208532

Groups

gptbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

proximic

Rule Path
Disallow /

ias-ie

Rule Path
Disallow /
Disallow /*?do=
Disallow /lib/exe/

Comments

  • 禁止爬虫访问所有 DokuWiki 的功能性、计算密集型页面