nawalakarsa.id
robots.txt

Robots Exclusion Standard data for nawalakarsa.id

Resource Scan

Scan Details

Site Domain nawalakarsa.id
Base Domain nawalakarsa.id
Scan Status Ok
Last Scan2024-10-05T10:51:43+00:00
Next Scan 2024-10-12T10:51:43+00:00

Last Scan

Scanned2024-10-05T10:51:43+00:00
URL https://nawalakarsa.id/robots.txt
Domain IPs 104.21.60.244, 172.67.202.201, 2606:4700:3030::ac43:cac9, 2606:4700:3033::6815:3cf4
Response IP 104.21.60.244
Found Yes
Hash a6c37b4715300fb1dd5219fa16bdaed0b42dfeb886d24bfeaae97babde95fc7a
SimHash 087ec8c0a113

Groups

*

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/
Disallow /wp-json/
Disallow /?rest_route=

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://nawalakarsa.id/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK