peninsulacellars.com
robots.txt

Robots Exclusion Standard data for peninsulacellars.com

Resource Scan

Scan Details

Site Domain peninsulacellars.com
Base Domain peninsulacellars.com
Scan Status Ok
Last Scan2025-05-07T18:27:00+00:00
Next Scan 2025-06-06T18:27:00+00:00

Last Scan

Scanned2025-05-07T18:27:00+00:00
URL https://peninsulacellars.com/robots.txt
Redirect https://www.peninsulacellars.com/robots.txt
Redirect Domain www.peninsulacellars.com
Redirect Base peninsulacellars.com
Domain IPs 34.235.25.248
Redirect IPs 34.235.25.248
Response IP 34.235.25.248
Found Yes
Hash 3939713797ae6f62d7d95ac3a0056c0f38431723ab366eabb1066a91739abff4
SimHash a9000e421370

Groups

*

Rule Path
Allow /
Disallow /trackback/
Disallow /wp-admin/
Disallow /wp-content/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /wp-
Disallow /cgi-bin
Disallow /readme.html
Disallow /license.txt
Disallow /*?*
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*/wp-*
Disallow /*/feed/*
Disallow /*/*?s=*
Disallow /*/*.js$
Disallow /*/*.inc$
Allow /wp-content/uploads/

ia_archiver*

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Other Records

Field Value
sitemap http://peninsulacellars.com/sitemap_index.xml

Comments

  • robots.txt from https://gist.github.com/andrewryno/5148255
  • Robots Rule! - Sometimes...
  • Disallow these directories, url types & file-types