feedspot.com
robots.txt

Robots Exclusion Standard data for feedspot.com

Resource Scan

Scan Details

Site Domain feedspot.com
Base Domain feedspot.com
Scan Status Ok
Last Scan2024-05-18T04:07:59+00:00
Next Scan 2024-06-17T04:07:59+00:00

Last Scan

Scanned2024-05-18T04:07:59+00:00
URL https://feedspot.com/robots.txt
Redirect https://www.feedspot.com/robots.txt
Redirect Domain www.feedspot.com
Redirect Base feedspot.com
Domain IPs 18.244.214.12, 18.244.214.14, 18.244.214.19, 18.244.214.97
Redirect IPs 35.165.175.52, 54.68.109.196
Response IP 54.68.109.196
Found Yes
Hash 1cb962bccb1ecd8a8556e1aca3f2401da40658282202b88166953a9c93dbf0f6
SimHash 6208ce206b43

Groups

psbot

Rule Path
Disallow /

cfnetwork

Rule Path
Disallow /

microsoft url control

Rule Path
Disallow /

java

Rule Path
Disallow /

httrack off-line browser

Rule Path
Disallow /

mbcrawler

Rule Path
Disallow /

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

linkedinbot

Rule Path
Disallow /infiniterss*

*

Rule Path
Disallow /search
Disallow /landing
Disallow /landing.php
Disallow /folder
Disallow /ct.php
Disallow /api
Disallow /register
Disallow /url
Disallow /fs/img
Disallow /fs/post
Allow /login
Allow /fs/about
Allow /fs/publisher
Allow /fs/contact

mediapartners-google*

Rule Path
Disallow