gpdp.it
robots.txt

Robots Exclusion Standard data for gpdp.it

Resource Scan

Scan Details

Site Domain gpdp.it
Base Domain gpdp.it
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-06-25T08:51:07+00:00
Next Scan 2024-07-25T08:51:07+00:00

Last Successful Scan

Scanned2024-05-04T08:50:16+00:00
URL https://gpdp.it/robots.txt
Domain IPs 15.160.73.215, 15.161.156.80
Response IP 15.160.73.215
Found Yes
Hash 884fcbc0d72027c3b8ed387dea2aa4bb5db61d8972d702c676686c2fef9a25d8
SimHash 840d5d7ce7de

Groups

*

Rule Path
Disallow /web/guest/pdf
Disallow /*pdf*
Disallow /*printPDF*
Disallow /*.pdf$
Disallow /pdf
Disallow /*996831$
Disallow /*Bollettino%2Bn.%2B20%2B-%2BMaggio%2B2001.pdf$
Disallow /*996849$
Disallow /*Bollettino%2Bn.%2B16%2B-%2BGennaio%2B2001.pdf$
Disallow /*996900$
Disallow /*Bollettino%2Bn.%2B6%2B-%2BSettembre-Dicembre%2B1998.pdf$
Disallow /*1456423$
Disallow /*1456423$
Disallow /*996831$
Disallow /*996849$
Disallow /*996900$
Disallow /*1456423$
Disallow /*9026818$
Disallow /c/portal/*
Disallow */control_panel/manage*

googlebot

Rule Path
Disallow /c/portal/*
Disallow */control_panel/manage*

bingbot

Rule Path
Disallow /c/portal/*
Disallow */control_panel/manage*

baiduspider

Rule Path
Disallow /c/portal/*
Disallow */control_panel/manage*

twitterbot

Rule Path
Disallow /c/portal/*
Disallow */control_panel/manage*