athinorama.gr
robots.txt

Robots Exclusion Standard data for athinorama.gr

Resource Scan

Scan Details

Site Domain athinorama.gr
Base Domain athinorama.gr
Scan Status Ok
Last Scan2024-05-06T03:01:52+00:00
Next Scan 2024-05-13T03:01:52+00:00

Last Scan

Scanned2024-05-06T03:01:52+00:00
URL https://athinorama.gr/robots.txt
Redirect https://www.athinorama.gr/robots.txt
Redirect Domain www.athinorama.gr
Redirect Base athinorama.gr
Domain IPs 104.26.2.215, 104.26.3.215, 172.67.68.234, 2606:4700:20::681a:2d7, 2606:4700:20::681a:3d7, 2606:4700:20::ac43:44ea
Redirect IPs 23.32.29.9, 96.17.180.51
Response IP 23.52.40.106
Found Yes
Hash a3ec06f0c958639477fc53e14e33257e2957037ef2a9334c43e2671a787c6338
SimHash 2b3bc84082b4

Groups

*

Rule Path
Disallow
Disallow /Api/*
Disallow /api/*
Disallow /Search*
Disallow /search*
Disallow /Newsletter*
Disallow /newsletter*

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.athinorama.gr/sitemap/movies
sitemap https://www.athinorama.gr/sitemap/theatres
sitemap https://www.athinorama.gr/sitemap/music
sitemap https://www.athinorama.gr/sitemap/restaurantsandclubs
sitemap https://www.athinorama.gr/sitemap/allnews
sitemap https://www.athinorama.gr/sitemap/googlenews
sitemap https://www.athinorama.gr/sitemap/categories

Comments

  • Block ChatGPT etc.