vroc.it
robots.txt

Robots Exclusion Standard data for vroc.it

Resource Scan

Scan Details

Site Domain vroc.it
Base Domain vroc.it
Scan Status Ok
Last Scan2025-07-14T10:31:32+00:00
Next Scan 2025-08-13T10:31:32+00:00

Last Scan

Scanned2025-07-14T10:31:32+00:00
URL https://vroc.it/robots.txt
Redirect https://www.vroc.it/robots.txt
Redirect Domain www.vroc.it
Redirect Base vroc.it
Domain IPs 31.11.36.12
Redirect IPs 31.11.36.12
Response IP 31.11.36.12
Found Yes
Hash 22070f8a1c35a138128942a1d9b184b7ef6e52019c755ba02660dfff81a2a081
SimHash a33af1b4691b

Groups

*

Rule Path
Disallow /secret/
Disallow /administrator/
Disallow /aiuti/
Disallow /anniversario/
Disallow /auguri/
Disallow /cache/
Disallow /cgi-bin/
Disallow /dmdocuments/
Disallow /editor/
Disallow /help/
Disallow /includes/
Disallow /jukebox/
Disallow /language/
Disallow /mambots/
Disallow /media/
Disallow /newsletter/
Disallow /patch/
Disallow /prenotazioni/
Disallow /spille/
Disallow /tecnica/
Disallow /testata/

naverbot
yeti

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image
sogou spider
youdaobot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Comments

  • Disallow: /associazione/
  • Disallow: /components/
  • Disallow: /images/
  • Disallow: /modules/
  • Disallow: /templates/
  • MOTORI DI RICERCA KOREANI
  • MOTORI DI RICERCA CINESI
  • MOTORI DI RICERCA RUSSI

Warnings

  • 2 invalid lines.