infogym.com
robots.txt

Robots Exclusion Standard data for infogym.com

Resource Scan

Scan Details

Site Domain infogym.com
Base Domain infogym.com
Scan Status Ok
Last Scan2024-07-07T19:24:06+00:00
Next Scan 2024-07-14T19:24:06+00:00

Last Scan

Scanned2024-07-07T19:24:06+00:00
URL https://infogym.com/robots.txt
Domain IPs 46.105.204.2
Response IP 46.105.204.2
Found Yes
Hash a74eea79104c6c6e732e79fb2cabf72b432143a8787f0a041b1d0dda2187637c
SimHash 531c9cc2ef98

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /fr/wp-admin/admin-ajax.php
Disallow /search/
Disallow /fr/search/
Disallow /wp-admin/
Disallow /fr/wp-admin/
Disallow /*public_html/
Disallow /fr/*public_html/
Disallow /*index.php?
Disallow /fr/*index.php?

Other Records

Field Value
crawl-delay 3600

semrushbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

feedburner

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.infogym.com/fr/sitemap.xml
sitemap http://www.infogym.com/fr/sitemap_index.xml
sitemap http://www.infogym.com/fr/post-sitemap.xml
sitemap http://www.infogym.com/fr/page-sitemap.xml
sitemap http://www.infogym.com/fr/category-sitemap.xml

Comments

  • Bloquer certains bots malveillants

Warnings

  • `host` is not a known field.