gossip-addict.com
robots.txt

Robots Exclusion Standard data for gossip-addict.com

Resource Scan

Scan Details

Site Domain gossip-addict.com
Base Domain gossip-addict.com
Scan Status Ok
Last Scan2024-11-08T11:39:53+00:00
Next Scan 2024-11-15T11:39:53+00:00

Last Scan

Scanned2024-11-08T11:39:53+00:00
URL https://gossip-addict.com/robots.txt
Redirect https://www.gossip-addict.com/robots.txt
Redirect Domain www.gossip-addict.com
Redirect Base gossip-addict.com
Domain IPs 178.32.151.192
Redirect IPs 178.32.151.192
Response IP 178.32.151.192
Found Yes
Hash d85f828329946376f53b76c021ce6c67eec475ec50f0fcae81f6aacbe3b30c06
SimHash ad26da7a4339

Groups

baiduspider
yisouspider
petalbot
bytespider
sogou web spider
sogou inst spider

Rule Path
Disallow /

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

*

Rule Path
Disallow /scores*
Disallow /home*
Disallow /*flux*
Disallow /*widget*
Disallow /*clubByName*
Disallow /*clubByFiltre*
Disallow /*modalite
Disallow /*fbshare
Disallow /*CacheiPhone
Disallow /*.php
Disallow /*newsFlux*
Disallow /*photosFlux*
Disallow /*videosFlux*
Disallow /*news_*
Disallow /*photos_*
Disallow /*videos_*
Disallow /*mraid*
Disallow /*archives
Allow /$
Allow /*article*
Allow /photos$
Allow /videos$
Allow /news*
Allow /photos*
Allow /videos*
Allow /*routing*

Warnings

  • 1 invalid line.