almanilan.com
robots.txt

Robots Exclusion Standard data for almanilan.com

Resource Scan

Scan Details

Site Domain almanilan.com
Base Domain almanilan.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-04-19T02:48:45+00:00
Next Scan 2025-04-26T02:48:45+00:00

Last Successful Scan

Scanned2025-04-11T01:48:05+00:00
URL https://almanilan.com/robots.txt
Domain IPs 104.21.96.141, 172.67.181.238, 2606:4700:3030::ac43:b5ee, 2606:4700:3034::6815:608d
Response IP 104.21.96.141
Found Yes
Hash 85891711d86ffcd63e589c979b1fad4ac565dde0608c64ca08c324ba115efbb2
SimHash eb0690326e13

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /author
Disallow /?attachment_id=*
Disallow /?replytocom=*
Disallow /*/?redirect_to=*
Disallow /?s=

chatgpt

Rule Path
Disallow /

openai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

sirdatabot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.almanilan.com/sitemap_index.xml