directindustry.it
robots.txt

Robots Exclusion Standard data for directindustry.it

Resource Scan

Scan Details

Site Domain directindustry.it
Base Domain directindustry.it
Scan Status Ok
Last Scan2026-02-14T05:22:19+00:00
Next Scan 2026-03-16T05:22:19+00:00

Last Scan

Scanned2026-02-14T05:22:19+00:00
URL https://www.directindustry.it/robots.txt
Domain IPs 104.18.18.206, 104.18.19.206
Response IP 104.18.18.206
Found Yes
Hash 126782cd36a5668d7bc702fa84eb8ad6e5f424d82f9cdcd064f7e19da6d7fe74
SimHash e902bd02efbb

Groups

ocelli

Rule Path
Disallow /

psbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

*

Rule Path
Disallow /images_*/2ai/
Disallow /restricted/
Disallow /*/restricted/
Disallow /r/
Disallow /*/r/
Disallow /scripts/
Disallow /*/scripts/
Disallow /tab/
Disallow /*/tab/
Disallow /pdf/tab/
Disallow /*/pdf/tab/
Disallow /*/pdf-en/
Disallow /cache_*/
Disallow /pdf/*/Show/
Disallow /*/pdf/*/Show/
Disallow /pdf/incat/
Disallow /*/pdf/incat/
Disallow /pdf/incatsoc/
Disallow /*/pdf/incatsoc/
Disallow /*favicon.ico
Disallow /*.pdf$
Disallow /pdf-en/
Disallow /ajax/
Disallow /*/ajax/
Disallow /static/ressources/
Disallow /*/static/ressources/
Disallow /*.json$
Disallow /request*$
Disallow /*/request*$
Disallow /images/*$
Disallow /localization/country/list.html$
Disallow /*/localization/country/list.html
Disallow /*?*
Disallow /myspace/
Disallow /*/myspace/
Disallow /tracking/*
Disallow /*/images_*/2ai/
Disallow /*/images/*
Disallow /*/tracking/*
Disallow /pdf/*-_*.html
Disallow /*/pdf/*-_*.html
Disallow /discover-us/thank-you.html
Disallow /newsletter/
Disallow /jsErrorHandler
Disallow /compare.html
Disallow /*/compare.html
Disallow /prod2/
Disallow /*/prod2/
Disallow /rfq/
Disallow /mailing/*
Disallow /*/mailing/*
Disallow /viewerCatalog/
Disallow /viewerCatalog-en/