infodiscus.com
robots.txt

Robots Exclusion Standard data for infodiscus.com

Resource Scan

Scan Details

Site Domain infodiscus.com
Base Domain infodiscus.com
Scan Status Ok
Last Scan2024-11-14T07:03:48+00:00
Next Scan 2024-11-21T07:03:48+00:00

Last Scan

Scanned2024-11-14T07:03:48+00:00
URL https://infodiscus.com/robots.txt
Domain IPs 46.105.204.2
Response IP 46.105.204.2
Found Yes
Hash 9dc05e4415f6e092b52a5752df8801cfb34bcb3db01f10b1de4d369ef9f2b285
SimHash 011c0ee349b8

Groups

*

Rule Path
Disallow /search/
Disallow /discuscom/search/
Allow /discuscom/wp-admin/admin-ajax.php
Disallow /discuscom/wp-admin
Disallow /discuscom/*public_html/
Disallow /discuscom/*index.php?

Other Records

Field Value
crawl-delay 30

semrushbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

feedburner

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.infodiscus.com/discuscom/sitemap.xml
sitemap http://www.infodiscus.com/discuscom/sitemap_index.xml
sitemap http://www.infodiscus.com/discuscom/post-sitemap.xml
sitemap http://www.infodiscus.com/discuscom/page-sitemap.xml
sitemap http://www.infodiscus.com/discuscom/category-sitemap.xml

Comments

  • Bloquer certains bots malveillants

Warnings

  • `host` is not a known field.