therpc.studio
robots.txt

Robots Exclusion Standard data for therpc.studio

Resource Scan

Scan Details

Site Domain therpc.studio
Base Domain therpc.studio
Scan Status Ok
Last Scan2025-09-25T21:44:07+00:00
Next Scan 2025-10-25T21:44:07+00:00

Last Scan

Scanned2025-09-25T21:44:07+00:00
URL https://therpc.studio/robots.txt
Redirect https://www.therpc.studio/robots.txt
Redirect Domain www.therpc.studio
Redirect Base therpc.studio
Domain IPs 74.208.236.39
Redirect IPs 74.208.236.39
Response IP 74.208.236.39
Found Yes
Hash 0104617ffae296d8b418c4e881ebd4bc255abcd543efe01c71f431ac68f5f702
SimHash 8746dddede47

Groups

*

Rule Path
Disallow /*%26limit
Disallow /*?sort
Disallow /*%26sort
Disallow /*?route=checkout%2F
Disallow /*?route=account%2F
Disallow /*?route=product%2Fsearch
Disallow /*?route=product%2Fsearch&tag
Disallow /*index.php?route=product%2Fsearch&tag
Disallow /*index.php?route=product%2Fsearch&tag=

baiduspider
yisouspider
petalbot
bytespider
sogou web spider
sogou inst spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.therpc.studio/index.php?route=information/sitemap

Warnings

  • 1 invalid line.