horrah.com
robots.txt

Robots Exclusion Standard data for horrah.com

Resource Scan

Scan Details

Site Domain horrah.com
Base Domain horrah.com
Scan Status Ok
Last Scan2024-11-12T00:21:00+00:00
Next Scan 2024-11-19T00:21:00+00:00

Last Scan

Scanned2024-11-12T00:21:00+00:00
URL https://horrah.com/robots.txt
Domain IPs 104.21.35.45, 172.67.213.152, 2606:4700:3031::6815:232d, 2606:4700:3032::ac43:d598
Response IP 104.21.35.45
Found Yes
Hash d00aed7dc0cfb34a8a8f81b2621740adec169e0b337e0e2f665e7e6aacf84159
SimHash 494874458665

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-login.php
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-content/themes
Disallow /trackback
Disallow */trackback
Disallow */feed
Disallow /?
Disallow /tags
Disallow /*?parameter=
Disallow /cgi-bin
Disallow /*.php$
Disallow /*.inc$
Disallow /*.gz$

*

Rule Path
Disallow /wp-content/uploads/wp-import-export-lite/

Other Records

Field Value
sitemap https://horrah.com/sitemap.xml

Comments

  • WP Import Export Rule

Warnings

  • 2 invalid lines.