Jai's Blog: October 2025

Referred Link - https://www.linkedin.com/posts/the-gen-academy_genacademy-genai-rag-activity-7374314296928899072-m3Z5

𝗧𝗵𝗶𝗻𝗸 𝗼𝗳 𝗥𝗔𝗚 𝗮𝘀 𝗴𝗶𝘃𝗶𝗻𝗴 𝘆𝗼𝘂𝗿 𝗔𝗜 𝗽𝗲𝗿𝗺𝗶𝘀𝘀𝗶𝗼𝗻 𝘁𝗼 “𝗼𝗽𝗲𝗻 𝗮 𝗯𝗼𝗼𝗸” 𝗯𝗲𝗳𝗼𝗿𝗲 𝗶𝘁 𝗮𝗻𝘀𝘄𝗲𝗿𝘀.

If you’ve bumped into Retrieval-Augmented Generation (RAG) and wondered what it really is (and when you actually need it), this mini-primer is for you.

𝗪𝗵𝗮𝘁 𝗥𝗔𝗚 𝗶𝘀 — 𝗶𝗻 𝗼𝗻𝗲 𝗯𝗿𝗲𝗮𝘁𝗵

RAG pairs a language model with an external knowledge source so answers are grounded in real, up-to-date information instead of just whatever the model remembers from training. That means fewer made-up facts and more verifiable responses.

𝗪𝗵𝗲𝗻 𝘆𝗼𝘂 𝘀𝗵𝗼𝘂𝗹𝗱 𝗿𝗲𝗮𝗰𝗵 𝗳𝗼𝗿 𝗥𝗔𝗚
✅You want a domain-specific assistant (HR policy bot, clinical FAQ, internal IT helper).
✅You need current info beyond a model’s training cutoff.
✅You care about citations and traceability.

𝗧𝗵𝗲 𝗽𝗶𝗽𝗲𝗹𝗶𝗻𝗲 (𝘀𝗶𝗺𝗽𝗹𝗲 𝘃𝗲𝗿𝘀𝗶𝗼𝗻)
✅𝗜𝗻𝗱𝗲𝘅𝗶𝗻𝗴 – Gather your sources (PDFs, sites, databases). Split long docs into smaller, meaningful “chunks,” turn each chunk into an embedding (a numeric vector), and store them in a vector database for fast similarity search.

✅𝗥𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 – Convert the user’s question into an embedding and fetch the closest chunks from the vector store.

✅𝗚𝗲𝗻𝗲𝗿𝗮𝘁𝗶𝗼𝗻 – Feed the question + retrieved chunks to the LLM to produce a grounded answer (and optionally add citations).

Why chunk? Models don’t magically use long context well; narrowing to the most relevant bits improves precision and keeps prompts lean.

𝗛𝗲𝗹𝗽𝗳𝘂𝗹 𝗮𝗱𝗱-𝗼𝗻𝘀 (𝘂𝘀𝗲 𝗮𝘀 𝗻𝗲𝗲𝗱𝗲𝗱)

✅𝗤𝘂𝗲𝗿𝘆 𝘁𝗿𝗮𝗻𝘀𝗹𝗮𝘁𝗶𝗼𝗻 (𝗛𝘆𝗗𝗘, 𝗺𝘂𝗹𝘁𝗶-𝗾𝘂𝗲𝗿𝘆): Rewrite or expand the question so retrieval finds better matches. HyDE, for instance, has the model draft a hypothetical answer, embed it, and search with that to boost recall.

✅𝗥𝗼𝘂𝘁𝗶𝗻𝗴 & 𝗰𝗼𝗻𝘀𝘁𝗿𝘂𝗰𝘁𝗶𝗼𝗻: If you have multiple stores (policies, product docs, web search), route the query to the best source and add filters (e.g., “last 90 days”).

𝗕𝘂𝗶𝗹𝗱𝗶𝗻𝗴 𝗯𝗹𝗼𝗰𝗸𝘀 (𝘄𝗶𝘁𝗵𝗼𝘂𝘁 𝘁𝗵𝗲 𝗵𝗲𝗮𝗱𝗮𝗰𝗵𝗲)
✅ 𝗟𝗮𝗻𝗴𝗖𝗵𝗮𝗶𝗻 (𝗰𝗵𝗮𝗶𝗻𝘀): Wire steps like “translate → retrieve → generate → parse” into a clear sequence you can swap and test.

✅𝗟𝗮𝗻𝗴𝗦𝗺𝗶𝘁𝗵 (𝗼𝗯𝘀𝗲𝗿𝘃𝗮𝗯𝗶𝗹𝗶𝘁𝘆): Trace every run, see timings and inputs/outputs, and debug failures—super handy once you go beyond demos.

𝗦𝘂𝗺𝗺𝗮𝗿𝘆 𝘆𝗼𝘂 𝗰𝗮𝗻 𝘁𝗮𝗸𝗲 𝘁𝗼 𝘄𝗼𝗿𝗸
✅Start simple: good chunking + a solid vector DB + a clear prompt template.
✅Measure what matters (accuracy on real tasks, not vibes).
✅Iterate: logs and traces will tell you where the bottleneck is.

Tags:

genai rag

Referred Link - https://www.linkedin.com/posts/prem-natarajan-ai_%F0%9D%90%8F%F0%9D%90%AB%F0%9D%90%A8%F0%9D%90%A6%F0%9D%90%A9%F0%9D%90%AD%F0%9D%90%A2%F0%9D%90%A7%F0%9D%90%A0-%F0%9D%90%9A%F0%9D%90%AD%F0%9D%90%AD%F0%9D%90%9A%F0%9D%90%9C%F0%9D%90%A4%F0%9D%90%AC-%F0%9D%90%9A%F0%9D%90%AB%F0%9D%90%9E-activity-7377318581971165184-hL-R/

𝐏𝐫𝐨𝐦𝐩𝐭𝐢𝐧𝐠 𝐚𝐭𝐭𝐚𝐜𝐤𝐬 𝐚𝐫𝐞 𝐭𝐡𝐞 𝐧𝐞𝐰 𝐭𝐲𝐩𝐞 𝐨𝐟 𝐀𝐈 𝐫𝐢𝐬𝐤𝐬.
While LLMs are powerful, they are also vulnerable to clever manipulations that bypass safeguards, expose sensitive data, or distort outputs.

Understanding these attacks is critical - not just for researchers, but also for businesses deploying AI. Here’s a breakdown of the major types of prompting attacks and how they operate:

🔑 𝐓𝐲𝐩𝐞𝐬 𝐨𝐟 𝐏𝐫𝐨𝐦𝐩𝐭𝐢𝐧𝐠 𝐀𝐭𝐭𝐚𝐜𝐤𝐬

1. Jailbreaks (Safety Bypass)
Trick the model into ignoring built-in rules or safety policies to return disallowed content.

2. Prompt Injection
Hide malicious commands inside external content so the model unknowingly executes them.

3. Instruction Overriding / Role Abuse
Convince the model to adopt roles or personas that override its safety checks.

4. Chained / Recursive Prompting
Break big restrictions into small prompts executed step-by-step until rules are bypassed.

5. Resource / Command Injection
Exploit API/tool access by forcing repeated costly, unauthorized, or harmful operations.

6. Prompt Leakage / Chain-of-Trust Attacks
Trick models into exposing hidden system prompts, policies, or internal instructions.

7. Adversarial Examples (Input Perturbation)
Slight text tweaks (like punctuation changes) that confuse parsing and produce wrong outputs.

8. Trojan / Backdoor Triggers
Hidden phrases or patterns that trigger abnormal model behavior when encountered.

9. Social-Engineering Prompts
Persuasive prompts that manipulate models into generating deceptive or fraudulent outputs.

10. Data Exfiltration via LLMs
Coax models into leaking private or sensitive data from prior context or training.

11. Model Inversion / Membership Inference
Probe models to infer whether specific records were part of their training set.

12. Covert Channels / Steganographic Prompts
Hide malicious instructions in harmless-looking text or encoded patterns.

Prompting attacks show that AI security is not just about model training - it is about prompt design, monitoring, and safeguards.
The more structured and layered your defences, the harder it becomes for attackers to exploit these vulnerabilities.

Tags:

#ArtificialIntelligence, #AI, #ChatGPT, #LLM

Referred Link -

https://www.linkedin.com/posts/aditya-hicounselor_ai-sql-nosql-activity-7378057053979844608-z-AX

𝗔𝘀 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀 𝗲𝘃𝗼𝗹𝘃𝗲 𝘁𝗼𝘄𝗮𝗿𝗱 𝗔𝗜, 𝘀𝗰𝗮𝗹𝗲, 𝗮𝗻𝗱 𝗿𝗲𝗮𝗹-𝘁𝗶𝗺𝗲 𝗱𝗲𝗰𝗶𝘀𝗶𝗼𝗻𝘀, 𝗱𝗮𝘁𝗮𝗯𝗮𝘀𝗲𝘀 𝗮𝗿𝗲 𝗻𝗼 𝗹𝗼𝗻𝗴𝗲𝗿 𝗼𝗻𝗲-𝘀𝗶𝘇𝗲-𝗳𝗶𝘁𝘀-𝗮𝗹𝗹.

It’s not just SQL vs NoSQL anymore. It’s about choosing the right database for the data, the workload, and the business outcome.

𝗟𝗲𝘁’𝘀 𝗯𝗿𝗲𝗮𝗸 𝗶𝘁 𝗱𝗼𝘄𝗻 👇

1. 𝗥𝗲𝗹𝗮𝘁𝗶𝗼𝗻𝗮𝗹 (SQL) Databases
Structured, reliable, and transaction-safe.
🔹 Banking | ERP | Order Management
Examples: MySQL | PostgreSQL | SQL Server

2. 𝗖𝗼𝗹𝘂𝗺𝗻𝗮𝗿 Databases
Built for analytics & fast aggregations.
🔹 Data Warehousing | BI | OLAP
Examples: Redshift | ClickHouse | HBase

3. 𝗧𝗶𝗺𝗲-𝗦𝗲𝗿𝗶𝗲𝘀 Databases
Optimized for metrics, events, and IoT data.
🔹 Monitoring | IoT | Real-time Analytics
Examples: InfluxDB | TimescaleDB | QuestDB

4. 𝗜𝗻-𝗠𝗲𝗺𝗼𝗿𝘆 Databases
Ultra-low latency & millisecond responses.
🔹 Fraud Detection | Real-time Apps | Caching
Examples: Redis | SAP HANA | Ignite

5. 𝗚𝗿𝗮𝗽𝗵 Databases
Focus on relationships & connections.
🔹 Social Networks | Recommendations | Knowledge Graphs
Examples: Neo4j | JanusGraph | Cosmos DB

6. 𝗗𝗼𝗰𝘂𝗺𝗲𝗻𝘁 Databases
Flexible schemas for semi-structured data.
🔹 CMS | User Profiles | Mobile Apps
Examples: MongoDB | Couchbase | Firestore

7. 𝗩𝗲𝗰𝘁𝗼𝗿 Databases
Powering AI & semantic search.
🔹 Generative AI | RAG | Similarity Search
Examples: Pinecone | Milvus | Weaviate

8. 𝗢𝗯𝗷𝗲𝗰𝘁-𝗢𝗿𝗶𝗲𝗻𝘁𝗲𝗱 Databases
Store data as objects aligned with code.
🔹 CAD/CAM | Complex Applications
Examples: db4o | ZODB

9. 𝗕𝗹𝗼𝗰𝗸𝗰𝗵𝗮𝗶𝗻 Databases
Immutable, secure, and decentralized.
🔹 Supply Chain | Provenance Tracking | Identity Verification
Examples: BigchainDB | CovenantSQL

Tags:
SQL NoSQL

Total Posts

Search this Site

Connect with Me

Translate Articles

Total Pageviews

Contributors

Certifications

My Favorite Links

Contact Form

Blog Archive

Recent Posts

Followers

Report Abuse

Popular Posts

Comments