miladcode.ir
robots.txt

Robots Exclusion Standard data for miladcode.ir

Resource Scan

Scan Details

Site Domain miladcode.ir
Base Domain miladcode.ir
Scan Status Ok
Last Scan2025-06-28T23:23:25+00:00
Next Scan 2025-07-28T23:23:25+00:00

Last Scan

Scanned2025-06-28T23:23:25+00:00
URL https://miladcode.ir/robots.txt
Redirect https://chatgpt.com/robots.txt
Redirect Domain chatgpt.com
Redirect Base chatgpt.com
Domain IPs 104.21.87.17, 172.67.139.50, 2606:4700:3032::6815:5711, 2606:4700:3032::ac43:8b32
Redirect IPs 104.18.32.47, 172.64.155.209, 2606:4700:4400::6812:202f, 2606:4700:4400::ac40:9bd1
Response IP 172.64.155.209
Found Yes
Hash 4a2b2c7cda3be67a6b01994c117d1bafe57b067172504530761da347d9b6740b
SimHash 6424c3b0c6b7

Groups

ccbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

*

Rule Path
Allow /$
Allow /?*
Allow /api/share/og/*
Allow /g/*
Allow /s/*
Allow /share/*
Allow /canvas/shared/*
Allow /images/*
Allow /auth/*
Allow /gpts$
Allow /codex$
Allow /search$
Allow /backend-anon/*
Allow /public-api/*
Allow /sitemap.xml
Allow /students
Allow /api/public_content/*
Allow /backend-api/public_content/*
Allow /?ref=dotcom
Disallow /
Disallow /auth/logout
Disallow /auth/login?*
Disallow /backend-anon/sentinel/*
Disallow /backend-anon/conversation$
Disallow /account-link/*

Other Records

Field Value
sitemap https://chatgpt.com/sitemap.xml

Comments

  • https://www.robotstxt.org/robotstxt.html
  • General rules for all other bots
  • Place allows first to avoid bots skipping after Disallow: /
  • Allow exactly the homepage
  • Allow the homepage with any query parameters
  • Now block everything else
  • Specific disallows (redundant for some bots, but still useful for those that respect precedence)