duniaku.net
robots.txt
Robots Exclusion Standard data for duniaku.net
Resource Scan
Scan Details
Site Domain | duniaku.net |
Base Domain | duniaku.net |
Scan Status | Ok |
Last Scan | 2024-05-23T04:15:44+00:00 |
Next Scan | 2024-06-22T04:15:44+00:00 |
Last Scan
Scanned | 2024-05-23T04:15:44+00:00 |
URL | https://duniaku.net/robots.txt |
Redirect | https://duniaku.idntimes.com/robots.txt |
Redirect Domain | duniaku.idntimes.com |
Redirect Base | idntimes.com |
Domain IPs | 104.21.92.190, 172.67.197.33, 2606:4700:3034::6815:5cbe, 2606:4700:3036::ac43:c521 |
Redirect IPs | 13.225.4.5, 13.225.4.55, 13.225.4.9, 13.225.4.94 |
Response IP | 13.225.4.94 |
Found | Yes |
Hash | 855cb8cd94937ed06c50b6e698431017da103288177b466f3631b442d1784d77 |
SimHash | 20006a6ad3f0 |
Groups
*
Rule | Path |
---|---|
Disallow | */ajax/* |
Disallow | /https%3A//twitter.com/ |
Disallow | /253109699/ |
Disallow | /https%3A//www.linkedin.com/ |
Disallow | *?utm_source= |
Disallow | *.ttf |
Disallow | /search* |
Other Records
Field | Value |
---|---|
sitemap | https://duniaku.idntimes.com/sitemap.xml |
Warnings
- 2 invalid lines.