iso.org
robots.txt

Robots Exclusion Standard data for iso.org

Resource Scan

Scan Details

Site Domain iso.org
Base Domain iso.org
Scan Status Ok
Last Scan2024-10-23T22:59:42+00:00
Next Scan 2024-11-06T22:59:42+00:00

Last Scan

Scanned2024-10-23T22:59:42+00:00
URL https://iso.org/robots.txt
Redirect https://www.iso.org/robots.txt
Redirect Domain www.iso.org
Redirect Base iso.org
Domain IPs 138.81.131.132
Redirect IPs 138.81.131.132
Response IP 138.81.131.132
Found Yes
Hash b9ecdeff6c96006f1ee11055f9f1736697238b36b33072c619d99873083bc33b
SimHash 2e79d8f0e237

Groups

*

Rule Path
Disallow /files/live/sites/isoorg/files/_noindex
Disallow /fr/search/x
Disallow /ru/search/x
Disallow /search/x
Disallow /advanced-search/x/
Disallow /fr/advanced-search/x/
Disallow /ru/advanced-search/x/
Disallow /em
Disallow /webstore/checkout
Disallow /webstore/shoppingbasket
Disallow /webstore/ShoppingBasket
Disallow /home.isoDocumentsDownload.do?

Other Records

Field Value
crawl-delay 5

irlbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.iso.org/sitemap.xml
sitemap https://www.iso.org/sitemap/standard.xml
sitemap https://www.iso.org/sitemap/standard1.xml
sitemap https://www.iso.org/sitemap/fr/standard.xml
sitemap https://www.iso.org/sitemap/fr/standard1.xml
sitemap https://www.iso.org/sitemap/ru/standard.xml
sitemap https://www.iso.org/sitemap/ru/standard1.xml
sitemap https://www.iso.org/sitemap/committee.xml
sitemap https://www.iso.org/sitemap/fr/committee.xml
sitemap https://www.iso.org/sitemap/ru/committee.xml
sitemap https://www.iso.org/sitemap/publication.xml
sitemap https://www.iso.org/sitemap/fr/publication.xml
sitemap https://www.iso.org/sitemap/ru/publication.xml
sitemap https://www.iso.org/obp/sitemapindex.xml