claro.eg
robots.txt

Robots Exclusion Standard data for claro.eg

Resource Scan

Scan Details

Site Domain claro.eg
Base Domain claro.eg
Scan Status Ok
Last Scan2025-12-18T13:45:41+00:00
Next Scan 2026-01-01T13:45:41+00:00

Last Scan

Scanned2025-12-18T13:45:41+00:00
URL https://www.claro.eg/robots.txt
Domain IPs 20.50.64.9, 2603:1020:5:5::47
Response IP 20.50.64.9
Found Yes
Hash b60fb9846f676332eacf99a43f6c41699e8967b99da54f3f46a9df1d779d8418
SimHash 9148dba1cab5

Groups

*

Rule Path
Disallow *?*sortBy=*
Disallow *?*minPrice=*
Disallow *?*maxPrice=*
Disallow *?*minSize=*
Disallow *?*maxSize=*
Disallow *?*filter-Halls=*
Disallow *?*filter-Floor=*
Disallow *?*payment-type=*
Allow *?page=*

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

httrack

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

facebookexternalhit/1.1

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

facebookexternalhit/1.0

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

facebookexternalhit/*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

Other Records

Field Value
sitemap https://www.claro.eg/sitemap.xml

Comments

  • Block filter parameters
  • Allow pagination
  • Sitemap reference
  • Block common scraper bots

Warnings

  • 3 invalid lines.