globeboss.com
robots.txt

Robots Exclusion Standard data for globeboss.com

Resource Scan

Scan Details

Site Domain globeboss.com
Base Domain globeboss.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-09T09:03:57+00:00
Next Scan 2026-01-07T09:03:57+00:00

Last Successful Scan

Scanned2025-06-12T02:29:59+00:00
URL https://globeboss.com/robots.txt
Redirect https://www.globeboss.com/robots.txt
Redirect Domain www.globeboss.com
Redirect Base globeboss.com
Domain IPs 51.89.185.221
Redirect IPs 51.89.185.221
Response IP 51.89.185.221
Found Yes
Hash b8b8185c1ed9f9d075b276b2ca73cfe31a0c20cc401b96a337aa8dd820e72117
SimHash 79045a7bcaa1

Groups

*

Rule Path Comment
Allow / -
Disallow /admin/ -
Disallow /wp-admin/ -
Disallow /wp-login.php -
Disallow /wp-includes/ -
Disallow /search/ -
Disallow /cart/ -
Disallow /checkout/ -
Disallow /my-account/ -
Disallow /private/ -
Disallow /tmp/ -
Disallow /cgi-bin/ -
Disallow /*?* -
Disallow /*.php$ -
Disallow /*.js$ -
Disallow /*.css$ -
Disallow /*.pdf$ Exclude PDFs unless they're important for SEO
Disallow /*?utm_ -
Disallow /*?ref= -
Disallow /*?sessionid= -

Other Records

Field Value
sitemap https://www.globeboss.com/sitemaps.xml

Comments

  • Block duplicate content from session IDs or tracking parameters
  • Sitemap location