scicomm.xyz
robots.txt

Robots Exclusion Standard data for scicomm.xyz

Resource Scan

Scan Details

Site Domain scicomm.xyz
Base Domain scicomm.xyz
Scan Status Ok
Last Scan2024-10-04T05:27:13+00:00
Next Scan 2024-10-05T05:27:13+00:00

Last Scan

Scanned2024-10-04T05:27:13+00:00
URL https://scicomm.xyz/robots.txt
Domain IPs 178.62.64.18, 2a03:b0c0:1:d0::1ec:4001
Response IP 178.62.64.18
Found Yes
Hash 55438dd4431cba3aee8f3e21fa9125bfcc3dff51021c45076a3d7b0470b2f901
SimHash aa74ba85f762

Groups

gptbot

Rule Path
Disallow /

ai2bot

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file