james5za.com
robots.txt

Robots Exclusion Standard data for james5za.com

Resource Scan

Scan Details

Site Domain james5za.com
Base Domain james5za.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-07-06T03:25:55+00:00
Next Scan 2025-10-04T03:25:55+00:00

Last Successful Scan

Scanned2023-07-01T04:17:30+00:00
URL https://james5za.com/robots.txt
Domain IPs 52.10.27.176
Response IP 52.10.27.176
Found Yes
Hash ac20fc7f2de81a98d9bdc7d38f701d7997c4dfff8656829c346dd08fac22a8c5
SimHash 6514de127354

Groups

*

Rule Path
Disallow /admin
Disallow /admin-2
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional
Disallow /password

adsbot-google

Rule Path
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /*theme_preview_id*
Disallow /checkout-additional

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /admin
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /admin
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value Comment
sitemap https://james5za.com/sitemap.xml -
sitemap https://james5za.com/sitemap.xml This will automatically remain your current primary domain to assure correct indexing.
sitemap https://james5za.com/sitemap.xml This will automatically remain your current primary domain to assure correct indexing.

Comments

  • Google adsbot ignores robots.txt unless specifically named!