balita.mb.com.ph
robots.txt

Robots Exclusion Standard data for balita.mb.com.ph

Resource Scan

Scan Details

Site Domain balita.mb.com.ph
Base Domain mb.com.ph
Scan Status Ok
Last Scan2024-11-03T07:41:10+00:00
Next Scan 2024-12-03T07:41:10+00:00

Last Scan

Scanned2024-11-03T07:41:10+00:00
URL https://balita.mb.com.ph/robots.txt
Domain IPs 104.22.50.163, 104.22.51.163, 172.67.30.39, 2606:4700:10::6816:32a3, 2606:4700:10::6816:33a3, 2606:4700:10::ac43:1e27
Response IP 104.22.51.163
Found Yes
Hash a3cb06f93a0e4eaf468d222edf29c7a0fe6f54f34dc6347432641bb040eb07eb
SimHash 88ad5e5769b2

Groups

*

Rule Path
Disallow /ajax/*
Disallow /print*
Disallow /getRelatedArticles*
Disallow /getMostReadArticles*
Disallow /article_count/*
Disallow /get-menu-header*
Disallow /article.php*
Disallow /login-mgt
Disallow /*.php
Disallow /widget/*
Disallow /?page=
Disallow /*search?q=
Disallow /*search?query=
Disallow /*search?
Disallow /*seaRch
Disallow /*seaRch?q
Disallow /*seaRch?query

Other Records

Field Value
sitemap https://balita.mb.com.ph/sitemaps/sitemap_0.xml