rferl.org
robots.txt

Robots Exclusion Standard data for rferl.org

Resource Scan

Scan Details

Site Domain rferl.org
Base Domain rferl.org
Scan Status Ok
Last Scan2024-09-25T09:44:35+00:00
Next Scan 2024-10-09T09:44:35+00:00

Last Scan

Scanned2024-09-25T09:44:35+00:00
URL https://rferl.org/robots.txt
Redirect https://www.rferl.org/robots.txt
Redirect Domain www.rferl.org
Redirect Base rferl.org
Domain IPs 23.32.29.107, 23.32.29.91, 2600:1413:b000:1b::17d7:71a, 2600:1413:b000:1b::17d7:71b
Redirect IPs 23.210.100.44, 2600:1413:b000:680::1317, 2600:1413:b000:699::1317
Response IP 104.103.151.177
Found Yes
Hash a07bed8c64f2cf26d62d4ba59c6393818972bbc198795768e979245fe04edc3f
SimHash 700cc84eea13

Groups

*

Rule Path
Disallow /z/*/*/*/*
Disallow /ebar*
Disallow /api/*
Disallow /tv/*/*/*/*/*
Disallow /radio/*/*/*/*/*
Disallow /schedule/*/*/*/*/*
Disallow /*?p=*
Disallow /comments/*
Disallow /embed/*
Disallow /s?k=*
Disallow /navigation.html
Disallow /captcha/iframe.html
Disallow /office365/login.html
Disallow /podcast/sublink/*

ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.rferl.org/sitemap.xml

Warnings

  • `clean-param` is not a known field.