ipsearch.io
robots.txt

Robots Exclusion Standard data for ipsearch.io

Resource Scan

Scan Details

Site Domain ipsearch.io
Base Domain ipsearch.io
Scan Status Ok
Last Scan2025-10-17T15:21:10+00:00
Next Scan 2025-10-24T15:21:10+00:00

Last Scan

Scanned2025-10-17T15:21:10+00:00
URL https://ipsearch.io/robots.txt
Domain IPs 104.21.14.31, 172.67.157.172, 2606:4700:3033::ac43:9dac, 2606:4700:3035::6815:e1f
Response IP 104.21.14.31
Found Yes
Hash 33906e94cfeadf28bf011c052a5118c2a7948f9a4797f2a4c6d703e82b4caefd
SimHash 001fc4c3ee93

Groups

petalbot
aspiegelbot
ahrefsbot
semrushbot
dotbot
mauibot
mj12bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /threads/*/reply

*

Rule Path
Disallow /whats-new/
Disallow /account/
Disallow /attachments/
Disallow /goto/
Disallow /posts/
Disallow /login/
Disallow /search/
Disallow /admin.php
Allow /

Other Records

Field Value
sitemap https://ipsearch.io/sitemap.xml