nxtwave.tech
robots.txt

Robots Exclusion Standard data for nxtwave.tech

Resource Scan

Scan Details

Site Domain nxtwave.tech
Base Domain nxtwave.tech
Scan Status Ok
Last Scan2025-12-06T09:44:57+00:00
Next Scan 2026-01-05T09:44:57+00:00

Last Scan

Scanned2025-12-06T09:44:57+00:00
URL https://nxtwave.tech/robots.txt
Redirect https://www.ccbp.in/robots.txt
Redirect Domain www.ccbp.in
Redirect Base ccbp.in
Domain IPs 108.156.144.100, 108.156.144.110, 108.156.144.29, 108.156.144.86
Redirect IPs 151.101.1.242, 151.101.129.242, 151.101.193.242, 151.101.65.242, 2a04:4e42:200::498, 2a04:4e42:400::498, 2a04:4e42:600::498, 2a04:4e42::498
Response IP 199.232.113.242
Found Yes
Hash e44f6ddf44816c14c4d0adc3d99991caa50466d3ef610be86ab8a68a9c789b2a
SimHash 74a781c1c5e4

Groups

*

Rule Path
Allow /*
Allow /reviews
Allow /academy-projects
Disallow /reviews*?
Disallow /academy-projects*?
Disallow /learning-reports/weekly-report*?
Disallow /unauthorized*
Disallow /blog/ar/*

ai2bot
ai2bot-dolma
amazonbot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
crawlspace
diffbot
duckassistbot
facebookbot
friendlycrawler
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
omgili
omgilibot
pangubot
perplexitybot
petalbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot

Rule Path
Disallow /privacy-policy
Disallow /cookie-policy
Disallow /academy-digital-partner-tc
Disallow /academy-invite-earn-tc
Disallow /academy-4-0-inspire-mentorship-terms-and-conditions
Disallow /academy-partner-program-tc
Disallow /intensive-alumni-terms-and-conditions
Disallow /intensive-invite-earn-tc
Disallow /hiring-partner-terms-and-conditions
Disallow /ta-streak-challenge-terms-and-conditions
Disallow /ta-streak-challenge-guide
Disallow /invite-and-earn
Disallow /corporate-information
Disallow /terms-and-conditions
Disallow /grievance-redressal
Disallow /vision-and-values
Disallow /4-0-champions
Disallow /tech-community
Disallow /net
Disallow /intensive/referral
Disallow /intensive-english
Disallow /intensive-telugu
Disallow /abroad
Disallow /pap-bootcamp

Other Records

Field Value
sitemap https://www.ccbp.in/sitemap.xml

Comments

  • Disallowed AI Bots to crawl the TnC Pages