help.instagram.com
robots.txt
Robots Exclusion Standard data for help.instagram.com
Resource Scan
Scan Details
Site Domain | help.instagram.com |
Base Domain | instagram.com |
Scan Status | Ok |
Last Scan | 2025-09-17T01:16:00+00:00 |
Next Scan | 2025-10-01T01:16:00+00:00 |
Last Scan
Scanned | 2025-09-17T01:16:00+00:00 |
URL | https://help.instagram.com/robots.txt |
Domain IPs | 2a03:2880:f34c:22:face:b00c:0:4420, 57.144.160.34 |
Response IP | 157.240.15.174 |
Found | Yes |
Hash | f8b23e98bfac51e3e02e9f49291f414a6660f00641ce1583de3a32ebd8b1271f |
SimHash | b8011d4d4d15 |
Groups
*
Rule | Path |
---|---|
Disallow | /*cursor%3D |
Disallow | /*fb_comment_id%3D |
Disallow | /ajax/ |
Disallow | /tealium/ |
Disallow | /intern/ |
Disallow | /internal/ |
Disallow | /login/ |
Disallow | /oidc/callback/ |
Disallow | /*.php |
Other Records
Field | Value |
---|---|
sitemap | https://help.instagram.com/sitemap/help_instagram_com_sitemap.xml.gz |
Comments