internic.com
robots.txt

Robots Exclusion Standard data for internic.com

Resource Scan

Scan Details

Site Domain internic.com
Base Domain internic.com
Scan Status Ok
Last Scan2026-01-13T18:28:16+00:00
Next Scan 2026-02-12T18:28:16+00:00

Last Scan

Scanned2026-01-13T18:28:16+00:00
URL http://www.internic.com/robots.txt
Domain IPs 192.0.46.9, 2620:0:2830:200::b:9
Response IP 192.0.46.9
Found Yes
Hash 863958b103a224a185933b58afeafa08be7812ae959ed11ebccf5519c764a3a9
SimHash ac266012cfdf

Groups

ia_archiver

Rule Path
Disallow

*

Rule Path
Disallow /problem_reports/

Other Records

Field Value
sitemap https://www.internic.net/sitemap.xml

Comments

  • Hi, welcome to the Internic robots.txt
  • Permit the Internet Archive to display anything (avoiding retroactive respect.)
  • Then by default just deny pages that redirect to ICANN (mainly problems)
  • Disallow: /403/
  • Finally a sitemap
  • Thanks, ptudor@icann