lavoratorio.it
robots.txt

Robots Exclusion Standard data for lavoratorio.it

Resource Scan

Scan Details

Site Domain lavoratorio.it
Base Domain lavoratorio.it
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-04T22:08:31+00:00
Next Scan 2024-10-11T22:08:31+00:00

Last Successful Scan

Scanned2024-09-26T21:58:28+00:00
URL https://lavoratorio.it/robots.txt
Domain IPs 46.28.3.137
Response IP 46.28.3.137
Found Yes
Hash f22d7bba9f362d123765ac64726b110774127498a8aeb2e40eadc8768f680b47
SimHash 6dcc1dd6ea80

Groups

*

Rule Path
Disallow /acquisizionelavoro-filelavoro.php
Disallow /bachecalavoro.php
Disallow /biglist.php
Disallow /indeed-needs.php
Disallow /jobcrawler.php
Disallow /jobisjob-lavoraresempre.php
Disallow /jobrapido_text.php
Disallow /mitula-latumi.php
Disallow /modena-screenmedia.php
Disallow /mrlavoro-lavoromr.php
Disallow /olx-comeolx.php
Disallow /renego-gonere.php
Disallow /simplyhired-noteasy.php
Disallow /trovit.php
Disallow /tuttoannunci.php
Disallow /yakaz-zaccaria.php
Disallow /stampa.php
Disallow /showpic.php
Disallow /anteprimaannuncio.php
Disallow /anteprimaarticolo.php
Disallow /anteprimalettera.php
Disallow /anteprimadocumento.php
Disallow /anteprimaeditoriale.php
Disallow /_gestione-contenuti/
Disallow /xml-motori/
Disallow /xml-screenmedia/
Disallow /admin/
Disallow /annunci/
Disallow /social-bookmarking.php
Disallow /crawler/
Disallow /ajs/
Disallow /services/
Disallow /api/
Disallow /temp/
Disallow /temp/send

Other Records

Field Value
crawl-delay 20

ubicrawler

Rule Path
Disallow /

grapeshot

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

exaudi crawler

Rule Path
Disallow /

scalaj-http

Rule Path
Disallow /

linkbot

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.lavoratorio.it/Sitemap.xml