canseo.ir
robots.txt

Robots Exclusion Standard data for canseo.ir

Resource Scan

Scan Details

Site Domain canseo.ir
Base Domain canseo.ir
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-04-22T16:53:29+00:00
Next Scan 2024-06-21T16:53:29+00:00

Last Successful Scan

Scanned2024-01-31T16:09:12+00:00
URL https://canseo.ir/robots.txt
Domain IPs 217.144.107.107
Response IP 217.144.107.107
Found Yes
Hash 5a501db4ef139da6c8fd7e3f6890264ed8be4b8757a2aa8c8deed6b6675362f5
SimHash 40005940ad13

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

ccbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /
Disallow /

isec_bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

trendkite-akashic-crawler

Rule Path
Disallow /

tineye-bot

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://canseo.ir/sitemap_index.xml

Warnings

  • `‍user-agent` is not a known field.