willhull.com
robots.txt

Robots Exclusion Standard data for willhull.com

Resource Scan

Scan Details

Site Domain willhull.com
Base Domain willhull.com
Scan Status Ok
Last Scan2024-06-01T21:24:40+00:00
Next Scan 2024-07-01T21:24:40+00:00

Last Scan

Scanned2024-06-01T21:24:40+00:00
URL https://willhull.com/robots.txt
Domain IPs 198.46.93.206
Response IP 198.46.93.206
Found Yes
Hash 6f5fcefb9d08b219a5de09f77a1f29b426d1008f0cda7b50edd3324f4c2f9e14
SimHash 057947a55750

Groups

*

Rule Path
Allow /
Disallow /blog/cgi-bin/
Disallow /cgi-bin/
Disallow /demo
Disallow /swf
Disallow /blog/wp-admin
Disallow /blog/wp-admin/*
Disallow /blog/wp-includes
Disallow /blog/wp-includes/*
Disallow /blog/wp-content
Disallow /blog/wp-content/cache
Disallow /blog/show-error-*
Disallow /blog/xmlrpc.php
Disallow /blog/trackback/
Disallow /blog/comment-page-
Disallow /.well-known/*
Disallow /about.html
Disallow /feedflare.xml
Disallow /resume.html
Disallow /thanks.html
Disallow /portfolio.html
Disallow /contact.html
Allow /images/*
Allow /js/*
Allow /docs/*
Allow /jQuery/*
Allow /swf/*
Allow /photogallery/*
Allow /demo/playball/*
Allow /wp-includes/js/*
Allow /blog/wp-content/uploads/*
Allow /blog/wp-content/themes/WHull_Blog_Update/*
Allow /blog/wp-content/cache/*
Allow /blog/wp-content/plugins/*

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot

Rule Path
Allow /

Other Records

Field Value
sitemap https://willhull.com/sitemap_index.xml