ccm.it
robots.txt

Robots Exclusion Standard data for ccm.it

Resource Scan

Scan Details

Site Domain ccm.it
Base Domain ccm.it
Scan Status Ok
Last Scan2025-07-23T03:17:13+00:00
Next Scan 2025-08-22T03:17:13+00:00

Last Scan

Scanned2025-07-23T03:17:13+00:00
URL https://www.ccm.it/robots.txt
Domain IPs 20.50.2.41
Response IP 20.50.2.41
Found Yes
Hash 23a8a5a3fd78adbc0ebd9ded6dc37e0a98c0b1742d91adb340989e9d22e38fb4
SimHash 033b14450370

Groups

ahrefsbot
blp_bbot
businessdbbot
ccbot
covarioids
converacrawler
curl/
discobot
download ninja
email exractor
ezooms
fdm 3.x
flaxcrawler
grabber
grapeshot
gslfbot
heritrix
httrack
intelium_bot
istellabot
java/
larbin
lemurwebcrawler
libwww-perl
metamojicrawler
mj12bot
nutch
openacoon
php/
plukkie
proximic
python-urllib
ruby
semrushbot
seokicks
spbot
turnitinbot
yandexbot
wbsearchbot
weblexbot
wget
wire/0.
zyborg

Rule Path
Disallow /

msnbot/bingbot
bingbot
msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

*

Rule Path
Disallow /*.wgx$
Disallow /AdminCMS
Disallow /admincms
Disallow /BatchCMS
Disallow /batchcms
Disallow /Admin/
Disallow /admin/
Disallow /Auth/
Disallow /Auth$
Disallow /auth/
Disallow /Batch/
Disallow /batch/
Disallow /Custom/
Disallow /custom/
Disallow /Tmp/
Disallow /tmp/
Disallow /LazyLogin
Disallow /lazylogin
Disallow /ShowReel
Disallow /showreel
Disallow /Search/SearchCMS
Disallow /search/searchcms
Disallow /Printing/
Disallow /printing/
Disallow /Custom/RenderAsPDF
Disallow /custom/renderaspdf
Disallow /LazyLogin/HitLog
Disallow /lazylogin/hitlog
Disallow /*%3Dprint%3Dtrue
Disallow /*%26ListMode%3D*
Disallow /*%26listmode%3D*
Disallow /*?ListMode=*
Disallow /*?listmode=*
Disallow /*%26SortMode%3D*
Disallow /*%26sortmode%3D*
Disallow /*?SortMode=*
Disallow /*?sortmode=*
Disallow /AreaPersonale

Other Records

Field Value
sitemap http://consorzioculturaledelmonfalconese.site/SitemapCms

Warnings

  • 1 invalid line.