ar.timesofisrael.com
robots.txt

Robots Exclusion Standard data for ar.timesofisrael.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ar.timesofisrael.com
Base Domain	timesofisrael.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-05-15T04:39:30+00:00
Next Scan	2024-07-14T04:39:30+00:00

Last Successful Scan

Scanned	2023-10-26T04:37:59+00:00
URL	https://ar.timesofisrael.com/robots.txt
Domain IPs	104.18.6.47, 104.18.7.47, 2606:4700::6812:62f, 2606:4700::6812:72f
Response IP	104.18.7.47
Found	Yes
Hash	676a1a8dd67ef27d973ea15669a47ec83c0b63a0859ab61480d21111d1301126
SimHash	ca0548d0da75

Groups

googlebot-image

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

/*

mediapartners-google*

Rule	Path
Disallow

Rule

Path

Disallow

duggmirror

Rule	Path
Disallow	/

Rule

Path

Disallow

/

twitterbot

Rule	Path
Disallow
Allow	/*

Rule

Path

Disallow

Allow

/*

googlebot-news

Rule	Path
Disallow	/spotlight/
Disallow	/announcements/

Rule

Path

Disallow

/spotlight/

Disallow

/announcements/

msnbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	3

Field

Value

crawl-delay

3

*

Rule	Path
Disallow	/cgi-bin/
Disallow	/wp-admin/
Disallow	/wp-login.php
Disallow	/wp-includes/
Disallow	/wp-content/cache/
Disallow	/trackback/
Disallow	/rss-feed/
Disallow	/feed/
Disallow	/comments/
Disallow	/cities/
Disallow	/types/
Disallow	/anniversaries/
Disallow	/companies/
Disallow	/test-hp/
Disallow	preview%3Dtrue
Disallow	/?p=*
Disallow	3933714/TOI

Rule

Path

Disallow

/cgi-bin/

Disallow

/wp-admin/

Disallow

/wp-login.php

Disallow

/wp-includes/

Disallow

/wp-content/cache/

Disallow

/trackback/

Disallow

/rss-feed/

Disallow

/feed/

Disallow

/comments/

Disallow

/cities/

Disallow

/types/

Disallow

/anniversaries/

Disallow

/companies/

Disallow

/test-hp/

Disallow

*preview%3Dtrue*

Disallow

/?p=*

Disallow

*3933714/TOI*

Back to top

Other Records

Field	Value
sitemap	https://ar.timesofisrael.com/sitemap_index.xml

Field

Value

sitemap

https://ar.timesofisrael.com/sitemap_index.xml

Back to top

Comments

Google Image
Google AdSense
digg mirror
Twiiter
Google News
MSN
global

Back to top

ar.timesofisrael.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

googlebot-image

mediapartners-google*

duggmirror

twitterbot

googlebot-news

msnbot

Other Records

*

Other Records

Comments

ar.timesofisrael.com
robots.txt