help.instagram.com
robots.txt

Robots Exclusion Standard data for help.instagram.com

Resource Scan

Scan Details

Site Domain help.instagram.com
Base Domain instagram.com
Scan Status Ok
Last Scan2025-09-17T01:16:00+00:00
Next Scan 2025-10-01T01:16:00+00:00

Last Scan

Scanned2025-09-17T01:16:00+00:00
URL https://help.instagram.com/robots.txt
Domain IPs 2a03:2880:f34c:22:face:b00c:0:4420, 57.144.160.34
Response IP 157.240.15.174
Found Yes
Hash f8b23e98bfac51e3e02e9f49291f414a6660f00641ce1583de3a32ebd8b1271f
SimHash b8011d4d4d15

Groups

facebookexternalhit

Rule Path
Allow *

*

Rule Path
Disallow /*cursor%3D
Disallow /*fb_comment_id%3D
Disallow /ajax/
Disallow /tealium/
Disallow /intern/
Disallow /internal/
Disallow /login/
Disallow /oidc/callback/
Disallow /*.php

Other Records

Field Value
sitemap https://help.instagram.com/sitemap/help_instagram_com_sitemap.xml.gz

Comments

  • Notice: Collection of data on Facebook through automated means is
  • prohibited unless you have express written permission from Facebook
  • and may only be conducted for the limited purpose contained in said
  • permission.
  • See: http://www.facebook.com/apps/site_scraping_tos_terms.php