Quiz: Security, Privacy, and Vector Search¶

Test your understanding of the graph security model, row-level security, attribute-based access control, zero-trust architecture, federated learning, model audit trails, vector databases, HNSW, product quantization, embedding models, dense retrieval, and the context-graph ROI model.

1. The chapter defines the Graph Security Model as having four pillars. Which set names them correctly?¶

Authentication, authorization, data classification, and audit logging
Encryption, hashing, signing, and salting
Read, write, update, delete
Firewall, IDS, IPS, VPN

Show Answer

The correct answer is A. The chapter names these four components explicitly. The other options are unrelated security or system concepts.

Concept Tested: Graph Security Model

2. Row-level security in a graph offers three modes when a traversal would reach a restricted node. Which set names them and matches the chapter's described trade-offs?¶

Block the traversal (most secure, can break query logic), redacted result (placeholder reveals node exists but content hidden), and return no result (most restrictive — hides node existence but can produce misleading semantics)
Allow all, deny all, prompt user
Read, write, execute
Encrypt, decrypt, sign

Show Answer

The correct answer is A. The chapter names exactly these three modes with the trade-offs the question describes. The other options are unrelated.

Concept Tested: Row-Level Security in Graph

3. Attribute-based access control evaluates which four categories of attributes when making an access decision?¶

Cost, latency, bandwidth, throughput
Subject (user) attributes, resource (graph node/edge) attributes, action (read/write/traverse) attributes, and environment (time, network, device posture) attributes
Source IP, destination IP, port, protocol
Color, size, shape, weight

Show Answer

The correct answer is B. The chapter names exactly these four categories that make ABAC more expressive than role-based access control. The other options are unrelated.

Concept Tested: Attribute-Based Access Control

Because least-privilege agent permissions limit the blast radius if an agent is compromised or manipulated by a prompt injection — a financial-exception analysis agent should not be able to read HR decision traces even when the deploying human has access to both
Because LLMs cannot handle large permission sets
Because regulators always require zero-permission agents
Because agents bill by the permission

Show Answer

The correct answer is A. The chapter argues exactly this least-privilege rationale for agents. The other options misstate the reason.

Concept Tested: Zero-Trust Graph Architecture

Federated learning — each participating data source trains a local model on local data and shares only model parameters (not raw data) with a central aggregator, so a unified model benefits from all data without any participant's data leaving its jurisdiction
Manually copy data to a third country and train there
Skip model training entirely
Centralize the data despite the law

Show Answer

The correct answer is A. The chapter prescribes federated learning specifically for cross-jurisdiction training. The other options violate the law or abandon the model.

Concept Tested: Federated Learning

6. A compliance officer needs to reconstruct exactly which model version served a specific decision on a specific date. Which component of the AI architecture supplies this evidence?¶

The HNSW index parameters
The vector database
The differential privacy budget
The model audit trail — training data provenance, evaluation metrics, model versioning (identified by cryptographic hash), and approval records — together let auditors reconstruct exactly which model was serving on a given date and how it was approved

Show Answer

The correct answer is D. The chapter prescribes the model audit trail as the source of this evidence. The other options serve different purposes.

Concept Tested: Model Audit Trail

7. Why does the chapter argue that exact nearest neighbor search is impractical for production context graph retrieval, motivating algorithms like HNSW?¶

Because exact NN search requires comparing the query vector to every stored vector — scaling linearly with collection size. For ten million 768-dimensional embeddings this is roughly 15 billion floating-point multiplications per query, which takes seconds at modern hardware speeds and is too slow for interactive decision support
Because exact NN search is forbidden by GDPR
Because LLMs cannot read exact NN results
Because exact NN search returns the wrong answers

Show Answer

The correct answer is A. The chapter computes exactly this cost to motivate ANN algorithms. The other options misstate the rationale.

Concept Tested: Approximate Nearest Neighbor

8. Product quantization reduces embedding memory consumption by 24-48x. Which production pattern does the chapter prescribe to manage the small accuracy loss it introduces?¶

Abandon product quantization entirely
Use PQ for candidate retrieval (fetch more candidates than needed at low cost), then re-rank the candidates using the original full-precision embeddings (at higher cost but on a much smaller set) — the standard two-stage production pattern
Train a completely new model from scratch
Increase the embedding dimensionality

Show Answer

The correct answer is B. The chapter prescribes the two-stage PQ-then-rerank pattern. The other options either give up the memory savings (A) or ignore the issue (C, D).

Concept Tested: Product Quantization

9. The chapter argues that domain-adapted sentence transformers outperform general-purpose embedding models for context graph retrieval. What positive feedback loop does this create?¶

There is no feedback loop
Adoption of the system reduces its training data
Users marking retrieved decision traces as "very relevant" or "not relevant" provide positive/negative pairs that can fine-tune the sentence transformer — so the more the system is used, the more feedback is collected, and the better the embedding model becomes over time
The embedding model is replaced weekly

Show Answer

The correct answer is C. The chapter describes this virtuous cycle explicitly. The other options ignore or invert the cycle.

Concept Tested: Sentence Transformers

10. A startup is presenting a context graph ROI model to skeptical executives. According to the chapter, what is the most common footgun to avoid?¶

Quoting too low a price
Double-counting — attributing the same time savings to both decision speed improvement and compliance cost reduction, or crediting the context graph for benefits actually produced by concurrent investments (a CRM upgrade, a new hire, a process change); an ROI model with double-counted benefits fails scrutiny when a skeptical stakeholder asks to walk through the methodology
Using too many decimal places
Reporting in the wrong currency

Show Answer

The correct answer is B. The chapter calls double-counting the most common ROI footgun and prescribes mapping each benefit to a distinct, separately-measurable metric. The other options are presentation details, not the structural failure mode the chapter warns about.

Concept Tested: Context Graph ROI Model

Quiz: Security, Privacy, and Vector Search¶

1. The chapter defines the Graph Security Model as having four pillars. Which set names them correctly?¶

2. Row-level security in a graph offers three modes when a traversal would reach a restricted node. Which set names them and matches the chapter's described trade-offs?¶

3. Attribute-based access control evaluates which four categories of attributes when making an access decision?¶

4. In a zero-trust graph architecture, why does the chapter recommend that AI agents be granted only the minimum permissions required for their task — even narrower than the human user who deployed them?¶

5. A multinational deploying a context graph across the EU and Asia must respect data residency laws that prohibit centralizing decision traces. Which technique does the chapter recommend for training a shared retrieval model without violating residency requirements?¶

6. A compliance officer needs to reconstruct exactly which model version served a specific decision on a specific date. Which component of the AI architecture supplies this evidence?¶

7. Why does the chapter argue that exact nearest neighbor search is impractical for production context graph retrieval, motivating algorithms like HNSW?¶

8. Product quantization reduces embedding memory consumption by 24-48x. Which production pattern does the chapter prescribe to manage the small accuracy loss it introduces?¶

9. The chapter argues that domain-adapted sentence transformers outperform general-purpose embedding models for context graph retrieval. What positive feedback loop does this create?¶

10. A startup is presenting a context graph ROI model to skeptical executives. According to the chapter, what is the most common footgun to avoid?¶