ipendidikan.my
robots.txt

Robots Exclusion Standard data for ipendidikan.my

Resource Scan

Scan Details

Site Domain ipendidikan.my
Base Domain ipendidikan.my
Scan Status Ok
Last Scan2025-01-10T13:01:10+00:00
Next Scan 2025-01-17T13:01:10+00:00

Last Scan

Scanned2025-01-10T13:01:10+00:00
URL https://ipendidikan.my/robots.txt
Redirect https://www.ipendidikan.my/robots.txt
Redirect Domain www.ipendidikan.my
Redirect Base ipendidikan.my
Domain IPs 104.21.65.166, 172.67.147.67, 2606:4700:3030::6815:41a6, 2606:4700:3037::ac43:9343
Redirect IPs 104.21.65.166, 172.67.147.67, 2606:4700:3030::6815:41a6, 2606:4700:3037::ac43:9343
Response IP 104.21.65.166
Found Yes
Hash 665683bda8731d7553aa7d48d2885b9cc8212946a806f3da846bb7f22127ecd4
SimHash 6bc4d9d66031

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /*/*.css
Allow /*/*.js
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow *?attachment_id=
Disallow /*~*
Disallow /*~

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

googlebot-video

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

baiduspider

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /wp-content/uploads/

Other Records

Field Value
sitemap https://www.ipendidikan.my/sitemap_index.xml