rakyat.news
robots.txt

Robots Exclusion Standard data for rakyat.news

Resource Scan

Scan Details

Site Domain rakyat.news
Base Domain rakyat.news
Scan Status Ok
Last Scan2024-09-27T22:42:26+00:00
Next Scan 2024-10-04T22:42:26+00:00

Last Scan

Scanned2024-09-27T22:42:26+00:00
URL https://rakyat.news/robots.txt
Domain IPs 82.197.71.2
Response IP 82.197.71.2
Found Yes
Hash 5fa1e219d42f18a810effcc4b6d51cac5feb83e6381ea54944dde2b80d200471
SimHash 69155041ce71

Groups

*

Rule Path
Allow /
Disallow /cdn-cgi/
Disallow /cgi-bin/
Disallow /wp-includes/
Disallow /wp-admin/
Disallow /page/*
Disallow /?nonamp=*

bingbot

Rule Path
Allow /
Disallow /cdn-cgi/
Disallow /cgi-bin/
Disallow /wp-includes/
Disallow /wp-admin/
Disallow /page/*
Disallow /?nonamp=*

msnbot

Rule Path
Allow /
Disallow /cdn-cgi/
Disallow /cgi-bin/
Disallow /wp-includes/
Disallow /wp-admin/
Disallow /page/*
Disallow /?nonamp=*

msnbot-media

Rule Path
Allow /

Other Records

Field Value
sitemap https://rakyat.news/feed
sitemap https://rakyat.news/sitemap.xml
sitemap https://rakyat.news/gnews.xml