sapikotak.id
robots.txt

Robots Exclusion Standard data for sapikotak.id

Resource Scan

Scan Details

Site Domain sapikotak.id
Base Domain sapikotak.id
Scan Status Ok
Last Scan2025-10-25T08:44:41+00:00
Next Scan 2025-11-01T08:44:41+00:00

Last Scan

Scanned2025-10-25T08:44:41+00:00
URL https://sapikotak.id/robots.txt
Domain IPs 104.21.34.103, 172.67.159.24, 2606:4700:3030::6815:2267, 2606:4700:3032::ac43:9f18
Response IP 104.21.34.103
Found Yes
Hash a682fb74dbd244afb41ab30e8c6e706ede11561a42502a188269c8944ea7c242
SimHash 6819650bd7b1

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /z/j/
Disallow /z/c/
Disallow /stats/
Disallow /dh_
Disallow /about/
Disallow /contact/
Disallow /tag/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /contact
Disallow /manual
Disallow /manual/*
Disallow /phpmanual/
Disallow /category/

googlebot

Rule Path
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*?*

duggmirror

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

Other Records

Field Value
sitemap https://www.sapikotak.id/sitemap_index.xml

Comments

  • disallow all files in these directories
  • disallow all files ending with these extensions
  • disallow all files with ? in url
  • disable duggmirror
  • allow google image bot to search all images
  • allow adsense bot on entire site