hawwae.com
robots.txt

Robots Exclusion Standard data for hawwae.com

Resource Scan

Scan Details

Site Domain hawwae.com
Base Domain hawwae.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-05T17:21:59+00:00
Next Scan 2024-12-04T17:21:59+00:00

Last Successful Scan

Scanned2023-12-26T16:38:10+00:00
URL https://hawwae.com/robots.txt
Domain IPs 52.10.27.176
Response IP 52.10.27.176
Found Yes
Hash b7359f1e8f9dd0cc2a0d0736280fc113b97c2ea5fbdbafb172bcd000fb260ad6
SimHash 6514de127154

Groups

*

Rule Path
Disallow /admin
Disallow /admin-2
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional
Disallow /password

adsbot-google

Rule Path
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /*theme_preview_id*
Disallow /checkout-additional

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /admin
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /admin
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value Comment
sitemap https://hawwae.com/sitemap.xml -
sitemap https://hawwae.com/sitemap.xml This will automatically remain your current primary domain to assure correct indexing.
sitemap https://hawwae.com/sitemap.xml This will automatically remain your current primary domain to assure correct indexing.

Comments

  • Google adsbot ignores robots.txt unless specifically named!