duniaku.net
robots.txt

Robots Exclusion Standard data for duniaku.net

Resource Scan

Scan Details

Site Domain duniaku.net
Base Domain duniaku.net
Scan Status Ok
Last Scan2024-05-23T04:15:44+00:00
Next Scan 2024-06-22T04:15:44+00:00

Last Scan

Scanned2024-05-23T04:15:44+00:00
URL https://duniaku.net/robots.txt
Redirect https://duniaku.idntimes.com/robots.txt
Redirect Domain duniaku.idntimes.com
Redirect Base idntimes.com
Domain IPs 104.21.92.190, 172.67.197.33, 2606:4700:3034::6815:5cbe, 2606:4700:3036::ac43:c521
Redirect IPs 13.225.4.5, 13.225.4.55, 13.225.4.9, 13.225.4.94
Response IP 13.225.4.94
Found Yes
Hash 855cb8cd94937ed06c50b6e698431017da103288177b466f3631b442d1784d77
SimHash 20006a6ad3f0

Groups

twitterbot

Rule Path
Allow *?utm_source=

adsbot-google

Rule Path
Disallow *.ttf

*

Rule Path
Disallow */ajax/*
Disallow /https%3A//twitter.com/
Disallow /253109699/
Disallow /https%3A//www.linkedin.com/
Disallow *?utm_source=
Disallow *.ttf
Disallow /search*

Other Records

Field Value
sitemap https://duniaku.idntimes.com/sitemap.xml

Warnings

  • 2 invalid lines.