irmagazineasia.com
robots.txt

Robots Exclusion Standard data for irmagazineasia.com

Resource Scan

Scan Details

Site Domain irmagazineasia.com
Base Domain irmagazineasia.com
Scan Status Ok
Last Scan2024-09-17T22:18:23+00:00
Next Scan 2024-09-24T22:18:23+00:00

Last Scan

Scanned2024-09-17T22:18:23+00:00
URL https://irmagazineasia.com/robots.txt
Domain IPs 104.21.68.166, 172.67.196.229, 2606:4700:3033::ac43:c4e5, 2606:4700:3034::6815:44a6
Response IP 172.67.196.229
Found Yes
Hash c524a58117fc57346a5da4db76b666334296aba24c8c9e2b7ea5887b43878773
SimHash 694b59110a15

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Allow /wp-admin/admin-ajax.php

dotbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

psbot

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

nbot

Rule Path
Disallow /

apache-httpclient

Rule Path
Disallow /

unknown robot identified by bot\*

Rule Path
Disallow /

robot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://irmagazineasia.com/sitemap_index.xml