perfectclean.ie
robots.txt

Robots Exclusion Standard data for perfectclean.ie

Resource Scan

Scan Details

Site Domain perfectclean.ie
Base Domain perfectclean.ie
Scan Status Ok
Last Scan2024-11-17T10:28:59+00:00
Next Scan 2024-12-17T10:28:59+00:00

Last Scan

Scanned2024-11-17T10:28:59+00:00
URL https://perfectclean.ie/robots.txt
Domain IPs 13.200.123.229, 13.234.100.116, 65.0.79.182
Response IP 13.234.100.116
Found Yes
Hash 7992c4d1f62262eea78ea5d9d4db0aa4eebdf266cc2f412a46050bfc4365300c
SimHash 6f22c8c10681

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /readme.html
Disallow */trackback
Disallow /xmlrpc.php
Disallow /*/feed
Disallow /*/comments
Disallow /*.php$
Disallow /*?
Disallow /blackhole/
Disallow /wp-login.php
Disallow /search/
Disallow %26p%3D
Disallow %26preview%3D
Disallow /tag/
Disallow /404/
Disallow /500/
Disallow /404-error/
Allow /?utm

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://perfectclean.ie/sitemap.xml

Warnings

  • 2 invalid lines.