giallorossi.net
robots.txt

Robots Exclusion Standard data for giallorossi.net

Resource Scan

Scan Details

Site Domain giallorossi.net
Base Domain giallorossi.net
Scan Status Ok
Last Scan2024-11-14T04:23:51+00:00
Next Scan 2024-11-21T04:23:51+00:00

Last Scan

Scanned2024-11-14T04:23:51+00:00
URL https://giallorossi.net/robots.txt
Domain IPs 185.81.0.25
Response IP 185.81.0.25
Found Yes
Hash 052b1c9282f9de4dbe39bcd17797390801414c12ad3fef25e8e9be6cb7d8b55d
SimHash 43555a0246f5

Groups

*

Rule Path
Allow /
Disallow /cgi-bin
Disallow /wp-admin
Disallow /e/
Disallow /show-error-*
Disallow /xmlrpc.php
Disallow /trackback/

turnitinbot

Rule Path
Disallow /

npbot-1/2.0

Rule Path
Disallow /

npbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

fatbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

psbot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

willybot

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

germcrawler

Rule Path
Disallow /

huaweisymantecspider

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

xaldon_webspider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.giallorossi.net/sitemap.xml
sitemap https://www.giallorossi.net/sitemap-news.xml