intairnet.org
robots.txt

Robots Exclusion Standard data for intairnet.org

Resource Scan

Scan Details

Site Domain intairnet.org
Base Domain intairnet.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer redirected incorrectly.
Last Scan2025-12-22T06:55:05+00:00
Next Scan 2026-03-22T06:55:05+00:00

Last Successful Scan

Scanned2025-05-03T13:38:57+00:00
URL https://intairnet.org/robots.txt
Redirect https://www.intairnet.org/robots.txt
Redirect Domain www.intairnet.org
Redirect Base intairnet.org
Domain IPs 104.21.48.5, 172.67.175.39, 2606:4700:3033::6815:3005, 2606:4700:3037::ac43:af27
Redirect IPs 104.21.48.5, 172.67.175.39, 2606:4700:3033::6815:3005, 2606:4700:3037::ac43:af27
Response IP 172.67.175.39
Found Yes
Hash 7795f69d15498b333f9eb481c695206837927c5508dc63c1d839587ea59223b3
SimHash 483142c0e2f3

Groups

ia_archiver

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

majestic-12

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /trackback/
Disallow */category/
Disallow /feed/
Disallow /comments/
Disallow */trackback/
Disallow /trackback*/
Disallow */feed/
Disallow /feed*/
Disallow */comments/
Disallow /comments*/
Disallow /*?*
Disallow /*?
Disallow /wp-login.php
Disallow /webhost/
Disallow /page/
Allow /wp-content/uploads/