im-in.space
robots.txt

Robots Exclusion Standard data for im-in.space

Resource Scan

Scan Details

Site Domain im-in.space
Base Domain im-in.space
Scan Status Ok
Last Scan2024-10-02T16:42:23+00:00
Next Scan 2024-10-03T16:42:23+00:00

Last Scan

Scanned2024-10-02T16:42:23+00:00
URL https://im-in.space/robots.txt
Domain IPs 2a01:4f8:162:70d2::2, 5.9.120.158
Response IP 5.9.120.158
Found Yes
Hash efded5649370003cd90e377385cf4b29bcb839b2b229ec95c87982d6331dac12
SimHash a874ba85f762

Groups

ia_archiver

Rule Path
Disallow /media_proxy/
Disallow

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /media_proxy/
Disallow /interact/

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file