themalaysialife.com
robots.txt

Robots Exclusion Standard data for themalaysialife.com

Resource Scan

Scan Details

Site Domain themalaysialife.com
Base Domain themalaysialife.com
Scan Status Ok
Last Scan2024-09-20T03:03:42+00:00
Next Scan 2024-09-27T03:03:42+00:00

Last Scan

Scanned2024-09-20T03:03:42+00:00
URL https://themalaysialife.com/robots.txt
Domain IPs 104.26.6.13, 104.26.7.13, 172.67.72.246, 2606:4700:20::681a:60d, 2606:4700:20::681a:70d, 2606:4700:20::ac43:48f6
Response IP 104.26.6.13
Found Yes
Hash 7fb2b3a7460a071a710253ea22b978d8c5488f7da3f15ef86197061d67aa18dd
SimHash 5815d9548113

Groups

*

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/
Disallow /wp-json/
Disallow /?rest_route=

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.themalaysialife.com/sitemap_index.xml