marcianise.info
robots.txt

Robots Exclusion Standard data for marcianise.info

Resource Scan

Scan Details

Site Domain marcianise.info
Base Domain marcianise.info
Scan Status Ok
Last Scan2024-11-02T05:23:35+00:00
Next Scan 2024-11-09T05:23:35+00:00

Last Scan

Scanned2024-11-02T05:23:35+00:00
URL https://marcianise.info/robots.txt
Redirect https://www.marcianise.info/robots.txt
Redirect Domain www.marcianise.info
Redirect Base marcianise.info
Domain IPs 2001:4b78:1001::6601, 217.64.195.178
Redirect IPs 2001:4b78:1001::6601, 217.64.195.178
Response IP 217.64.195.178
Found Yes
Hash 55c3e8a414deeb43bdc08a7bcdfd9a018620e99731462eb63d706c24f32b0aa5
SimHash 6a41d882c6e0

Groups

facebookexternalhit
googlebot
googlebot-news
googlebot-image
googlebot-mobile
mediapartners-google
mediapartners
webnews arianna
grapeshot

Rule Path
Disallow
Disallow /wp-admin/

*

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap http://www.marcianise.info/sitemap.xml.gz

Comments

  • USER AGENT PERMESSI a meno della sola cartella wp-admin
  • User-agent: Googlebot-Video
  • User-agent: AdsBot-Google serve a adwords
  • User-agent: Bingbot
  • User-agent: MSNBot
  • User-agent: MSNBot-Media
  • User-agent: BingPreview
  • User-agent: Slurp
  • Allow: /
  • Request-rate: 1/10
  • Visit-time: 2200-0600