bahasaenglish.com
robots.txt

Robots Exclusion Standard data for bahasaenglish.com

Resource Scan

Scan Details

Site Domain bahasaenglish.com
Base Domain bahasaenglish.com
Scan Status Ok
Last Scan2024-11-14T02:31:49+00:00
Next Scan 2024-11-21T02:31:49+00:00

Last Scan

Scanned2024-11-14T02:31:49+00:00
URL https://bahasaenglish.com/robots.txt
Domain IPs 104.21.82.195, 172.67.162.212, 2606:4700:3034::6815:52c3, 2606:4700:3037::ac43:a2d4
Response IP 172.67.162.212
Found Yes
Hash 52cdffab5e9ba90e3dbf3f878a6f83b3590a48f35e196df2cd8bdace74068361
SimHash 5b1c46d2a63b

Groups

*

Rule Path
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-json/
Disallow /xmlrpc.php
Disallow /readme.html
Disallow /*?
Disallow /page/*
Disallow /?s=
Allow /*.css
Allow /*.js

yandex
baiduspider
sogou web spider
cliqzbot
mappy
mojeekbot

Rule Path
Disallow */*

special_archiver
proximic

Rule Path
Disallow */*

twitterbot

Rule Path
Disallow */*

dotbot
semrushbot
ahrefsbot
mj12bot
spbot
extlinksbot

Rule Path
Disallow */*

mail.ru_bot

Rule Path
Disallow */*

Other Records

Field Value
sitemap https://www.bahasaenglish.com/sitemap_index.xml

Comments

  • Basic Bots Configuration
  • Block Unwanted Search Engine Bots
  • Block Web Stats Bots
  • Block Social Media Bots
  • Block SEO Tools Bots
  • Block Annoying Bots

Warnings

  • `host` is not a known field.