journalist.id
robots.txt

Robots Exclusion Standard data for journalist.id

Resource Scan

Scanned	2024-09-26T19:46:57+00:00
URL	https://journalist.id/robots.txt
Domain IPs	103.217.144.250
Response IP	103.217.144.250
Found	Yes
Hash	2ae93ce485a1a90b48f6b405985fa1ba9651e0f10594d0633420380a3504b57c
SimHash	0d3c5c74c384

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Rule	Path
Disallow	/api/
Disallow	/pdf-post/*

Rule

Path

Disallow

/api/

Disallow

/pdf-post/*

Field	Value
crawl-delay	1800

Field

Value

crawl-delay

1800

Back to top

Field	Value
sitemap	https://journalist.id/sitemap.xml

Field

Value

sitemap

https://journalist.id/sitemap.xml

Back to top