sinarbestari.sinarharian.com.my
robots.txt

Robots Exclusion Standard data for sinarbestari.sinarharian.com.my

Resource Scan

Scan Details

Site Domain sinarbestari.sinarharian.com.my
Base Domain sinarharian.com.my
Scan Status Ok
Last Scan2024-05-18T19:17:56+00:00
Next Scan 2024-06-17T19:17:56+00:00

Last Scan

Scanned2024-05-18T19:17:56+00:00
URL https://sinarbestari.sinarharian.com.my/robots.txt
Domain IPs 104.18.87.98, 104.18.88.98
Response IP 104.18.87.98
Found Yes
Hash d24e6a30ccae7bd5c8bf5053ee5b859e6540926ce501393c880ef01e10084b12
SimHash 91ad70676ff4

Groups

*

Rule Path
Disallow /ajax/*
Disallow /print*
Disallow /getRelatedArticles*
Disallow /getMostReadArticles*
Disallow /article_count/*
Disallow /get-menu-header*
Disallow /search*
Disallow /morearticles/*
Disallow /article.php*
Disallow /login-mgt
Disallow /*.php
Disallow /archive/*
Disallow /widget/*
Disallow */page/*

grapeshot

Rule Path
Disallow