peaceweb.net
robots.txt

Robots Exclusion Standard data for peaceweb.net

Resource Scan

Scan Details

Site Domain peaceweb.net
Base Domain peaceweb.net
Scan Status Ok
Last Scan2026-02-24T19:35:34+00:00
Next Scan 2026-03-10T19:35:34+00:00

Last Scan

Scanned2026-02-24T19:35:34+00:00
URL https://peaceweb.net/robots.txt
Redirect https://peaceweb.com/robots.txt
Redirect Domain peaceweb.com
Redirect Base peaceweb.com
Domain IPs 104.21.52.100, 172.67.198.16, 2606:4700:3036::ac43:c610, 2606:4700:3037::6815:3464
Redirect IPs 172.66.40.223, 172.66.43.33, 2606:4700:3108::ac42:28df, 2606:4700:3108::ac42:2b21
Response IP 172.66.43.33
Found Yes
Hash 551edf9da40fa1cb086dadcb75016b2ad79ff18ddecb39c0ffaa71255d76729b
SimHash 2f349b30a432

Groups

*

Rule Path
Disallow /llms.txt
Disallow /llms-full.txt
Disallow /admin
Disallow /api
Disallow /dashboard
Disallow /storage
Disallow /vendor
Disallow /.env
Disallow /.git
Disallow /login
Disallow /register
Disallow /password
Disallow /email
Disallow /cart
Disallow /checkout
Disallow /my
Disallow /settings
Disallow /*?*sort=
Disallow /*?*filter=
Disallow /*?*page=
Allow /css
Allow /js
Allow /img
Allow /build
Allow /sw-ipmarket.js
Allow /en/
Allow /nl/
Allow /de/
Allow /es/
Allow /fr/

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard
Disallow /login
Disallow /register

Other Records

Field Value
crawl-delay 0

bingbot

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard
Disallow /login
Disallow /register

Other Records

Field Value
crawl-delay 1

gptbot

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

chatgpt-user

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

claude-web

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

anthropic-ai

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

perplexitybot

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

google-extended

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

applebot-extended

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

ccbot

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

cohere-ai

Rule Path
Allow /
Disallow /admin
Disallow /api
Disallow /dashboard

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

semrushbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://ipmarket.io/sitemap.xml

Comments

  • IP Market Robots.txt
  • Last updated: 2026-01-29
  • Disallow LLM-specific files from search engine indexing
  • Disallow sensitive application directories
  • Disallow search and filter pages with parameters
  • Allow public assets
  • Allow important marketing pages
  • Crawl delay to prevent aggressive crawling
  • Sitemaps
  • Google-specific rules
  • Bing-specific rules
  • AI Crawlers - Allow for better AI understanding
  • Block aggressive/unwanted bots