harianhaluan.id
robots.txt

Robots Exclusion Standard data for harianhaluan.id

Resource Scan

Scan Details

Site Domain harianhaluan.id
Base Domain harianhaluan.id
Scan Status Ok
Last Scan2024-11-13T06:44:07+00:00
Next Scan 2024-11-20T06:44:07+00:00

Last Scan

Scanned2024-11-13T06:44:07+00:00
URL https://harianhaluan.id/robots.txt
Domain IPs 104.21.17.102, 172.67.175.116, 2606:4700:3032::6815:1166, 2606:4700:3036::ac43:af74
Response IP 104.21.17.102
Found Yes
Hash 55020007cd4b489bc16a76761084d828b625f4dd264a5e11416ea16ed78ff4c2
SimHash c140c8c0a41f

Groups

*

Rule Path
Allow /?s=
Allow /page/*/?s=
Allow /search/
Allow /wp-json/
Allow /?rest_route=

Other Records

Field Value
sitemap https://harianhaluan.id/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK