pgol.it
robots.txt

Robots Exclusion Standard data for pgol.it

Resource Scan

Scan Details

Site Domain pgol.it
Base Domain pgol.it
Scan Status Ok
Last Scan2024-08-24T09:07:32+00:00
Next Scan 2024-09-23T09:07:32+00:00

Last Scan

Scanned2024-08-24T09:07:32+00:00
URL http://pgol.it/robots.txt
Redirect https://www.paginegialle.it/robots.txt
Redirect Domain www.paginegialle.it
Redirect Base paginegialle.it
Domain IPs 213.209.19.251
Redirect IPs 13.226.2.41, 13.226.2.67, 13.226.2.81, 13.226.2.91, 2600:9000:21f8:1400:18:2d66:e480:93a1, 2600:9000:21f8:1e00:18:2d66:e480:93a1, 2600:9000:21f8:5a00:18:2d66:e480:93a1, 2600:9000:21f8:9800:18:2d66:e480:93a1, 2600:9000:21f8:a200:18:2d66:e480:93a1, 2600:9000:21f8:b800:18:2d66:e480:93a1, 2600:9000:21f8:e400:18:2d66:e480:93a1, 2600:9000:21f8:e600:18:2d66:e480:93a1
Response IP 18.165.171.90
Found Yes
Hash 455d693277592977592635540748d8fd163813b5f4cec1ce0e82cbe5904f8653
SimHash 861efd6eadf2

Groups

*

Rule Path
Disallow /ricerca/
Disallow /profilo/
Disallow /deu/
Disallow /pgol/
Disallow /pg/cgi/
Disallow /pgolfe/
Disallow /info/*.html
Disallow /mappa/
Disallow /vcard*
Disallow *rk%3D*
Disallow *mr%3D*
Disallow *f%3D*
Disallow *sort%3D*
Disallow *addr%3D*
Disallow *filtro%3D*
Disallow *abt%3D*
Disallow *sede%3D*
Disallow */elencosedi/*/mappa
Disallow /*/preview

petalbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.paginegialle.it/sitemap.xml
sitemap https://www.paginegialle.it/sitemap_fe.xml

Comments

  • robots file for SEAT Pagine Gialle