grokent.com
robots.txt

Robots Exclusion Standard data for grokent.com

Resource Scan

Scan Details

Site Domain grokent.com
Base Domain grokent.com
Scan Status Ok
Last Scan2025-05-11T15:15:48+00:00
Next Scan 2025-05-18T15:15:48+00:00

Last Scan

Scanned2025-05-11T15:15:48+00:00
URL https://grokent.com/robots.txt
Domain IPs 104.21.96.3, 172.67.150.25, 2606:4700:3032::6815:6003, 2606:4700:3032::ac43:9619
Response IP 172.67.150.25
Found Yes
Hash 1692a1906f445882818e1e6df4bdff04ae1ca65e03e0767b7e50db4417723cda
SimHash ee18585b6b98

Groups

*

Rule Path
Allow /$
Allow /product/
Allow /shop/
Allow /product-category/
Allow /product-tag/
Allow /tag/
Allow /brand/
Allow /blog/
Allow /category/
Allow /about-us/
Allow /contact-us/
Allow /terms-and-conditions/
Allow /*?filter%2F
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-content/cache/
Disallow /readme.html
Disallow /license.txt
Disallow /search/
Disallow /comments/
Disallow /trackback/
Disallow /feed/
Disallow /embed/
Disallow /page/
Disallow /author/
Disallow /*?orderby=
Disallow /*?order=

Other Records

Field Value
sitemap https://grokent.com/sitemap.xml

Comments

  • Allowing access to main sections of the site
  • Disallowing access to admin, includes, and sensitive files
  • Blocking unnecessary pages and search results
  • Disallow parameters that might create duplicate content
  • Sitemap location for better indexing