irdc.ir
robots.txt

Robots Exclusion Standard data for irdc.ir

Resource Scan

Scan Details

Site Domain irdc.ir
Base Domain irdc.ir
Scan Status Ok
Last Scan2026-02-03T00:57:31+00:00
Next Scan 2026-03-05T00:57:31+00:00

Last Scan

Scanned2026-02-03T00:57:31+00:00
URL https://irdc.ir/robots.txt
Domain IPs 194.41.48.13
Response IP 194.41.48.13
Found Yes
Hash f661fe56a6d41734e8d7ab9579e0287df6814c246552dfc0028eba0253d76cd4
SimHash 2cd41d65cbc4

Groups

*

Rule Path
Disallow /files/adv/*
Disallow */*.swf
Disallow /admin
Disallow /fa/ads/*
Disallow /fa/ajax/*
Disallow /en/ads/*
Disallow /en/ajax/*
Disallow /ar/ads/*
Disallow /ar/ajax/*
Disallow /fa/report/*
Disallow /ar/report/*
Disallow /en/report/*
Disallow /fa/send/*
Disallow /ar/send/*
Disallow /en/send/*
Disallow /ar/print/*
Disallow /en/print/*
Disallow /ar/news/play/*
Disallow /en/news/play/*
Disallow /fa/tags/*
Disallow /ar/tags/*
Disallow /en/tags/*
Disallow /fa/save/*
Disallow /ar/save/*
Disallow /en/save/*
Disallow /fa/services/section/*
Disallow /en/services/section/*
Disallow /ar/services/section/*

Other Records

Field Value
sitemap https://irdc.ir/sitemap.xml
sitemap https://irdc.ir/fa-sitemap-newsarchive
sitemap https://irdc.ir/fa-sitemap-news
sitemap https://irdc.ir/ar-sitemap-newsarchive
sitemap https://irdc.ir/ar-sitemap-news
sitemap https://irdc.ir/en-sitemap-newsarchive
sitemap https://irdc.ir/en-sitemap-news