newscaf.com
robots.txt

Robots Exclusion Standard data for newscaf.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	newscaf.com
Base Domain	newscaf.com
Scan Status	Ok
Last Scan	2026-01-09T08:03:51+00:00
Next Scan	2026-01-16T08:03:51+00:00

Last Scan

Scanned	2026-01-09T08:03:51+00:00
URL	https://newscaf.com/robots.txt
Redirect	https://www.newscaf.com/robots.txt
Redirect Domain	www.newscaf.com
Redirect Base	newscaf.com
Domain IPs	198.72.102.251
Redirect IPs	198.72.102.251
Response IP	198.72.102.251
Found	Yes
Hash	a22adabb915b9b5b1bcc0f48e718fd3fa90ad8720ccec0d461ef29df590421e2
SimHash	4a35fd61c211

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-image

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/out/

Rule

Path

Disallow

/out/

slurp

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

wotbox

Rule	Path
Disallow	/

Rule

Path

Disallow

aboundexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

jikespider

Rule	Path
Disallow	/

Rule

Path

Disallow

sentibot

Rule	Path
Disallow	/

Rule

Path

Disallow

ahrefsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

baiduspider

Rule	Path
Disallow	/

Rule

Path

Disallow

yandexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

linguee

Rule	Path
Disallow	/

Rule

Path

Disallow

applebot

Rule	Path
Disallow	/

Rule

Path

Disallow

*

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	15

Field

Value

crawl-delay

newscaf.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot-image

*

slurp

mj12bot

wotbox

aboundexbot

jikespider

sentibot

ahrefsbot

blexbot

rogerbot

semrushbot

semrushbot-sa

baiduspider

yandexbot

linguee

applebot

*

Other Records

newscaf.com
robots.txt