gnet.globes.co.il
robots.txt

Robots Exclusion Standard data for gnet.globes.co.il

Resource Scan

Scan Details

Site Domain gnet.globes.co.il
Base Domain globes.co.il
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2024-11-08T04:06:43+00:00
Next Scan 2025-02-06T04:06:43+00:00

Last Successful Scan

Scanned2024-07-12T03:50:51+00:00
URL https://gnet.globes.co.il/robots.txt
Domain IPs 80.70.128.53
Response IP 80.70.128.53
Found Yes
Hash 6896e680cc0fafd1fb03cdfbf491cc9101e477ae20a0e0f86ac1e5b459557749
SimHash 8b3f19082b90

Groups

telegrambot (like twitterbot)

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /adstream/
Disallow /home_scripts
Disallow /7263/
Disallow %40CONTENT-REF
Disallow /news/undefined/
Disallow /en/undefined/
Disallow /bulletin/
Disallow /shared/
Disallow /apps/
Allow /bulletin/divors/nirim.html

Other Records

Field Value
sitemap http://www.globes.co.il/data/webservices/google-maps.ashx
sitemap http://www.globes.co.il/data/webservices/google-maps.ashx?language=he
sitemap http://www.globes.co.il/data/webservices/google-maps.ashx?language=en

Comments

  • Robots.txt file
  • All robots will spider the domain