annuncianimali.it
robots.txt

Robots Exclusion Standard data for annuncianimali.it

Resource Scan

Scan Details

Site Domain annuncianimali.it
Base Domain annuncianimali.it
Scan Status Ok
Last Scan2024-05-14T11:04:02+00:00
Next Scan 2024-05-21T11:04:02+00:00

Last Scan

Scanned2024-05-14T11:04:02+00:00
URL https://annuncianimali.it/robots.txt
Redirect https://www.annuncianimali.it/robots.txt
Redirect Domain www.annuncianimali.it
Redirect Base annuncianimali.it
Domain IPs 104.26.4.160, 104.26.5.160, 172.67.72.68, 2606:4700:20::681a:4a0, 2606:4700:20::681a:5a0, 2606:4700:20::ac43:4844
Redirect IPs 104.26.4.160, 104.26.5.160, 172.67.72.68, 2606:4700:20::681a:4a0, 2606:4700:20::681a:5a0, 2606:4700:20::ac43:4844
Response IP 104.26.5.160
Found Yes
Hash a60f35f5fdc8431f403fb25515c31156a318f2af39e10b586b930eba936cad76
SimHash 7a43fb4563b8

Groups

ias_crawler

Rule Path
Disallow

ias_wombles

Rule Path
Disallow

*

Rule Path
Disallow /api/*
Disallow /registrazione/cambia-email/*/
Disallow /registrazione/conferma-email/*/
Disallow /registrazione/controlla-email/*
Disallow /annunci/modifica/*
Disallow /annunci/anteprima/*
Disallow /annunci/evidenza/*
Disallow /vas/checkout/*
Disallow /annunci/nuovo/*
Disallow /create-new-listing/*
Disallow /form-engine-playground/*
Disallow /chat/*
Disallow /*.pdf$
Disallow /*?ordinare=
Disallow /*?sesso=
Disallow /*?dataDiNascita=
Disallow /*?condizione=
Disallow /*?assistenza=
Allow /cani-razze/?page=*
Allow /gatti-razze/?page=*
Allow */*/?category=*
Allow */*/?scade=true

Other Records

Field Value
sitemap https://www.annuncianimali.it/sitemaps/sitemap_index.xml

Warnings

  • 1 invalid line.