wpxbox.com
robots.txt

Robots Exclusion Standard data for wpxbox.com

Resource Scan

Scan Details

Site Domain wpxbox.com
Base Domain wpxbox.com
Scan Status Ok
Last Scan2024-11-16T13:45:01+00:00
Next Scan 2024-11-23T13:45:01+00:00

Last Scan

Scanned2024-11-16T13:45:01+00:00
URL https://wpxbox.com/robots.txt
Redirect https://www.wpxbox.com/robots.txt
Redirect Domain www.wpxbox.com
Redirect Base wpxbox.com
Domain IPs 104.21.78.197, 172.67.136.209, 2606:4700:3035::ac43:88d1, 2606:4700:3037::6815:4ec5
Redirect IPs 104.21.78.197, 172.67.136.209, 2606:4700:3035::ac43:88d1, 2606:4700:3037::6815:4ec5
Response IP 104.21.78.197
Found Yes
Hash c2ae710dc37b67baeab9f58362d96625d52e78e3e4c365c9376cdf82bb7ee825
SimHash e85450101133

Groups

*

Rule Path
Allow /

baiduspider

Rule Path
Disallow /

anthropicbot

Rule Path
Disallow /

claude

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /porpoiseant/*
Disallow /beardeddragon/*
Disallow /tardisrocinante/*
Disallow /ezais/*
Disallow /parsonsmaize/*
Disallow /edmontonalberta/*
Disallow /detroitchicag0/*
Disallow *?gcb=*
Disallow *?cb=*
Disallow *?cmbcb=*
Disallow *?bv=*

Warnings

  • 2 invalid lines.
  • `https` is not a known field.