gencat.cat
robots.txt
Robots Exclusion Standard data for gencat.cat
Resource Scan
Scan Details
Site Domain | gencat.cat |
Base Domain | gencat.cat |
Scan Status | Ok |
Last Scan | 2024-11-01T11:23:37+00:00 |
Next Scan | 2024-12-01T11:23:37+00:00 |
Last Scan
Scanned | 2024-11-01T11:23:37+00:00 |
URL | https://gencat.cat/robots.txt |
Domain IPs | 83.247.151.41 |
Response IP | 83.247.151.41 |
Found | Yes |
Hash | a74452c448f9116936b1fddb3732806dbc8d704d0813a4352cd7a33995de2e48 |
SimHash | d00c956a3782 |
Groups
*
Rule | Path |
---|---|
Disallow | /mediamb/pn/espais |
Disallow | /eadop/imatges/ |
Disallow | /eadop/imagenes/ |
Disallow | /vtls |
Disallow | /cgi-bin |
Disallow | /diari/ |
Disallow | /diari_c/ |
Disallow | /treball/doc/doc_44220496_1.pdf |