internth.com
robots.txt

Robots Exclusion Standard data for internth.com

Resource Scan

Scan Details

Site Domain internth.com
Base Domain internth.com
Scan Status Ok
Last Scan2025-12-16T15:01:34+00:00
Next Scan 2025-12-23T15:01:34+00:00

Last Scan

Scanned2025-12-16T15:01:34+00:00
URL https://internth.com/robots.txt
Domain IPs 104.21.85.48, 172.67.202.76, 2606:4700:3034::ac43:ca4c, 2606:4700:3036::6815:5530
Response IP 104.21.85.48
Found Yes
Hash db9b121f1af683fc63e0a2a410f2f44dd9875894fa2acab6e790bad79e678689
SimHash 3905dc86c192

Groups

*

Rule Path
Disallow /company/
Disallow /user/

Other Records

Field Value
sitemap https://internth.com/sitemap.xml