petergreenaway.info
robots.txt

Robots Exclusion Standard data for petergreenaway.info

Resource Scan

Scan Details

Site Domain petergreenaway.info
Base Domain petergreenaway.info
Scan Status Ok
Last Scan2025-10-29T22:46:28+00:00
Next Scan 2025-11-05T22:46:28+00:00

Last Scan

Scanned2025-10-29T22:46:28+00:00
URL https://petergreenaway.info/robots.txt
Redirect https://owlcti.za.com/robots.txt
Redirect Domain owlcti.za.com
Redirect Base owlcti.za.com
Domain IPs 104.21.52.63, 172.67.196.52, 2606:4700:3037::6815:343f, 2606:4700:3037::ac43:c434
Redirect IPs 104.21.86.168, 172.67.222.100, 2606:4700:3032::ac43:de64, 2606:4700:3034::6815:56a8
Response IP 104.21.86.168
Found Yes
Hash c219bf5e1bfdeced35016cded1f0795c7b3b825046ed3b3a3f8d739d8629a482
SimHash c6354953cd95

Groups

*

Rule Path
Allow /

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

teleport
teleportpro
emailcollector
emailsiphon
webbandit
webzip
webreaper
webstripper
web downloader
ahrefsbot
semrushbot
mj12bot
webcopier
offline explorer pro
offline explorer
httrack website copier
offline commander
leech
websnake
blackwidow
http weazel

Rule Path
Disallow /

*

Rule Path
Disallow /video/*
Disallow /admin/
Disallow /dieu-khoan.html
Disallow /lien-he.html
Disallow /api/*

Other Records

Field Value
sitemap https://owlcti.za.com/abcccc-sitemap.xml

Comments

  • As a condition of accessing this website, you agree to abide by the following
  • content signals:
  • (a) If a content-signal = yes, you may collect content for the corresponding
  • use.
  • (b) If a content-signal = no, you may not collect content for the
  • corresponding use.
  • (c) If the website operator does not include a content signal for a
  • corresponding use, the website operator neither grants nor restricts
  • permission via content signal with respect to the corresponding use.
  • The content signals and their meanings are:
  • search: building a search index and providing search results (e.g., returning
  • hyperlinks and short excerpts from your website's contents). Search does not
  • include providing AI-generated search summaries.
  • ai-input: inputting content into one or more AI models (e.g., retrieval
  • augmented generation, grounding, or other real-time taking of content for
  • generative AI search answers).
  • ai-train: training or fine-tuning AI models.
  • ANY RESTRICTIONS EXPRESSED VIA CONTENT SIGNALS ARE EXPRESS RESERVATIONS OF
  • RIGHTS UNDER ARTICLE 4 OF THE EUROPEAN UNION DIRECTIVE 2019/790 ON COPYRIGHT
  • AND RELATED RIGHTS IN THE DIGITAL SINGLE MARKET.
  • BEGIN Cloudflare Managed content
  • END Cloudflare Managed Content

Warnings

  • `content-signal` is not a known field.