harapanrakyat.com
robots.txt

Robots Exclusion Standard data for harapanrakyat.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	harapanrakyat.com
Base Domain	harapanrakyat.com
Scan Status	Ok
Last Scan	2024-11-05T06:06:56+00:00
Next Scan	2024-11-12T06:06:56+00:00

Last Scan

Scanned	2024-11-05T06:06:56+00:00
URL	https://harapanrakyat.com/robots.txt
Domain IPs	104.26.6.204, 104.26.7.204, 172.67.69.119, 2606:4700:20::681a:6cc, 2606:4700:20::681a:7cc, 2606:4700:20::ac43:4577
Response IP	104.26.6.204
Found	Yes
Hash	f26f1f66eff03f2aa645162eb23caf53a14d43ad8d0c1b94569e82521880c9e0
SimHash	cf24d726b4a1

Groups

*

Rule	Path
Disallow	/wp-admin/
Allow	/wp-admin/admin-ajax.php

Rule

Path

Disallow

/wp-admin/

Allow

/wp-admin/admin-ajax.php

*

Rule	Path
Disallow	/?s=

Rule

Path

Disallow

/?s=

*

Rule	Path
Disallow	/search/

Rule

Path

Disallow

/search/

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

adsbot-google

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-image

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-news

Rule	Path
Allow	/

Rule

Path

Allow

googlebot

Rule	Path
Allow	/

Rule

Path

Allow

nuclei

Rule	Path
Disallow	/

Rule

Path

Disallow

wikido

Rule	Path
Disallow	/

Rule

Path

Disallow

riddler

Rule	Path
Disallow	/

Rule

Path

Disallow

petalbot

Rule	Path
Disallow	/

Rule

Path

Disallow

zoominfobot

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

node/simplecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

cazoodlebot

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot/1.0

Rule	Path
Disallow	/

Rule

Path

Disallow

gigabot

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

magpie-crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

openai

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.harapanrakyat.com/sitemap_index.xml
sitemap	https://www.harapanrakyat.com/news-sitemap.xml

Field

Value

sitemap

https://www.harapanrakyat.com/sitemap_index.xml

sitemap

https://www.harapanrakyat.com/news-sitemap.xml

harapanrakyat.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

*

*

mediapartners-google

adsbot-google

googlebot-mobile

googlebot-image

googlebot-news

googlebot

nuclei

wikido

riddler

petalbot

zoominfobot

amazonbot

node/simplecrawler

cazoodlebot

dotbot/1.0

gigabot

barkrowler

blexbot

magpie-crawler

chatgpt-user

openai

ccbot

gptbot

Other Records

harapanrakyat.com
robots.txt