stripersonline.com
robots.txt

Robots Exclusion Standard data for stripersonline.com

Resource Scan

Scan Details

Site Domain stripersonline.com
Base Domain stripersonline.com
Scan Status Ok
Last Scan2024-10-04T02:31:40+00:00
Next Scan 2024-10-11T02:31:40+00:00

Last Scan

Scanned2024-10-04T02:31:40+00:00
URL https://stripersonline.com/robots.txt
Domain IPs 72.52.250.24
Response IP 72.52.250.24
Found Yes
Hash 40bfe55d5a43a73f38bcb911bb0b3eeeca36941308c71abee80d0e930bec51e0
SimHash f9160921c888

Groups

amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
googleother-image
googleother-video
icc-crawler
imagesiftbot
meta-externalagent
meta-externalfetcher
gptbot
oai-searchbot
perplexitybot
petalbot
scrapy
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
img2dataset
omgili
omgilibot

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /surftalk/gallery/*

baiduspider

Rule Path
Disallow /

baidu

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

*

Rule Path
Disallow /surftalk/startTopic/
Disallow /surftalk/discover/unread/
Disallow /surftalk/markallread/
Disallow /surftalk/online/
Disallow /surftalk/discover/
Disallow /surftalk/leaderboard/
Disallow /surftalk/search/
Disallow /surftalk/tags/
Disallow /surftalk/*?advancedSearchForm=
Disallow /surftalk/register/
Disallow /surftalk/lostpassword/
Disallow /surftalk/login/
Disallow /surftalk/*?sortby=
Disallow /surftalk/*?filter=
Disallow /surftalk/*?tab=
Disallow /surftalk/*?do=
Disallow /surftalk/*ref%3D
Disallow /surftalk/*?forumId*

Other Records

Field Value
sitemap https://www.stripersonline.com/surftalk/sitemap.php
sitemap https://www.stripersonline.com/surftalk/sitemap.php

Comments

  • Sitemap
  • User-agent: Google-Extended
  • User-agent: GoogleOther
  • Block pages with no unique content
  • Disallow: /surftalk/staff/
  • Block faceted pages and 301 redirect pages
  • Block profile pages as these have little unique value, consume a lot of crawl time and contain hundreds of 301 links
  • Disallow: /surftalk/profile/
  • Sitemap URL