harian.disway.id
robots.txt

Robots Exclusion Standard data for harian.disway.id

Resource Scan

Scan Details

Site Domain harian.disway.id
Base Domain disway.id
Scan Status Ok
Last Scan2024-05-12T18:52:59+00:00
Next Scan 2024-06-11T18:52:59+00:00

Last Scan

Scanned2024-05-12T18:52:59+00:00
URL https://harian.disway.id/robots.txt
Domain IPs 104.26.14.37, 104.26.15.37, 172.67.75.73, 2606:4700:20::681a:e25, 2606:4700:20::681a:f25, 2606:4700:20::ac43:4b49
Response IP 104.26.14.37
Found Yes
Hash 7378f9a9754f978173f3a1504bf8c80abaed12b72f5aae753493b9d3261da407
SimHash c94918556735

Groups

*
googlebot

Rule Path
Allow /

Other Records

Field Value
sitemap https://harian.disway.id/sitemap.xml
sitemap https://harian.disway.id/sitemap/index.xml
sitemap https://harian.disway.id/sitemap/image.xml
sitemap https://harian.disway.id/sitemap/tag.xml
sitemap https://harian.disway.id/frontend/sitemapgoogle
sitemap https://harian.disway.id/frontend/sitemap