eireprints.com
robots.txt

Robots Exclusion Standard data for eireprints.com

Resource Scan

Scan Details

Site Domain eireprints.com
Base Domain eireprints.com
Scan Status Ok
Last Scan2024-09-26T16:38:59+00:00
Next Scan 2024-10-26T16:38:59+00:00

Last Scan

Scanned2024-09-26T16:38:59+00:00
URL http://eireprints.com/robots.txt
Redirect https://hubirish.com/robots.txt
Redirect Domain hubirish.com
Redirect Base hubirish.com
Domain IPs 46.166.189.98
Redirect IPs 104.21.79.60, 172.67.142.138, 2606:4700:3030::ac43:8e8a, 2606:4700:3031::6815:4f3c
Response IP 172.67.142.138
Found Yes
Hash e1002019676f966b8f7bb15d332a971cb88f60ca9c87bcae7d960bd7f6534551
SimHash 6f6469236fda

Groups

*

Rule Path Comment
Disallow /wp-admin/ -
Allow /wp-admin/admin-ajax.php -
Disallow /wp-includes/ -
Disallow /wp-content/plugins/ -
Disallow /wp-content/cache/ -
Disallow /wp-content/themes/ -
Disallow /cgi-bin/ -
Disallow /*.php$ -
Disallow /*.cgi$ -
Disallow /*.svn$ -
Disallow /*.git$ -
Disallow /*.env$ Environment configuration files
Disallow /*?currency=* Currency parameter URLs
Disallow /*?s= Search results pages
Disallow /*?*session_id= Session ID-based URLs
Disallow /*?add-to-cart= WooCommerce add-to-cart links
Disallow /*?orderby= Sorting options
Disallow /*?filter_* Filter parameters
Disallow /product-tag/*?currency=* All product tags with currency parameters
Disallow /product-category/*?currency=* All product categories with currency parameters
Disallow /product-category/*/page/*?currency=* Paginated product categories with currency parameters
Disallow /category/ -
Disallow /tag/ -
Disallow /page/*/ Pagination pages
Disallow /search/ -
Disallow /wp-login.php -
Disallow /wp-register.php -
Disallow /wp-signup.php -
Disallow /author/ -

googlebot

Rule Path
Allow /*.js$
Allow /*.css$
Allow /wp-content/uploads/
Allow /wp-content/themes/*/*.css
Allow /wp-content/themes/*/*.js

bingbot

Rule Path
Allow /*.js$
Allow /*.css$
Allow /wp-content/uploads/
Allow /wp-content/themes/*/*.css
Allow /wp-content/themes/*/*.js

yandex

Rule Path
Allow /*.js$
Allow /*.css$
Allow /wp-content/uploads/
Allow /wp-content/themes/*/*.css
Allow /wp-content/themes/*/*.js
Disallow /privacy-policy/
Disallow /terms-and-conditions/
Disallow /tmp/
Disallow /backup/
Disallow /old/
Disallow /test/
Disallow /scripts/
Disallow /tools/
Disallow /*.log$
Disallow /*.json$
Disallow /*?utm_*
Disallow /*?fbclid=*

Other Records

Field Value
sitemap https://hubirish.com/sitemap_index.xml

Comments

  • General settings
  • Disallow access to sensitive directories
  • Disallow access to specific file types (commonly used by WordPress)
  • Block all bots from crawling URLs with currency parameters to avoid duplicate content
  • Block common WooCommerce query parameters that lead to duplicate or unnecessary pages
  • Disallow crawling of product category and tag pages with pagination and currency parameters
  • Block common duplicate content pages
  • Block access to internal search results pages
  • Block access to login, registration, and admin pages
  • Block author archive pages if not in use
  • Allow Googlebot to index certain scripts and styles
  • Allow specific bots (e.g., Bingbot, Yandex, etc.) access to certain files
  • Additional security measures
  • Prevent indexing of sensitive data (GDPR compliance)
  • Prevent indexing of temporary and duplicate content
  • Optimize crawl budget by blocking unnecessary scripts and tools
  • Optimize crawl budget by blocking bots from loading non-essential or duplicate resources
  • Sitemap location

Warnings

  • 1 invalid line.