horoscope.walla.co.il
robots.txt

Robots Exclusion Standard data for horoscope.walla.co.il

Resource Scan

Scan Details

Site Domain horoscope.walla.co.il
Base Domain walla.co.il
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-22T12:35:04+00:00
Next Scan 2024-11-21T12:35:04+00:00

Last Successful Scan

Scanned2024-09-23T12:33:05+00:00
URL https://horoscope.walla.co.il/robots.txt
Domain IPs 52.84.229.115, 52.84.229.47, 52.84.229.56, 52.84.229.82
Response IP 52.84.229.56
Found Yes
Hash 9d1123e172b29798e5e9bdb491b0ff46da8067d39efd34581b3283d4586f292f
SimHash c34a12cd06f1

Groups

*

Rule Path
Disallow *%26timeStamp%3D
Disallow *%40entity
Disallow *%40opinion
Disallow *%40placation
Disallow *%40poll.results
Disallow /%26tagfly%3D1
Disallow *%40search%26
Disallow *hotOrNot.commit
Disallow /43010785/*
Disallow */userfeedback/facebook.stream.publish
Disallow *mobile%3D*
Disallow *?0*
Disallow *?feature
Disallow *layout%3D
Disallow /*?mediazone*
Disallow /*%26mediazone*
Disallow *navindex%3D*
Disallow /*?fallback*
Disallow /*?fb_comment_id*
Disallow /player.html*
Disallow /rm*
Disallow *?_w_open_tlk
Disallow *?year=20*

mediapartners-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://horoscope.walla.co.il/sitemap/8/newsmap.xml
sitemap https://horoscope.walla.co.il/sitemap/8/index.xml

Comments

  • robots.txt - 2018-03-13
  • robots file for domain: https://horoscope.walla.co.il/