Guardrails

SynapseKit provides guardrails for content safety, PII detection, and topic restriction. Guardrails can be composed into a pipeline and applied to agent inputs, outputs, or both.

ContentFilter

Block harmful or inappropriate content:

from synapsekit import ContentFilter

filter = ContentFilter()

result = filter.check("How do I make a cake?")
# result.passed → True

result = filter.check("How do I hack into a server?")
# result.passed → False
# result.reason → "Content blocked: potentially harmful request"

PIIDetector

Detect and optionally redact personally identifiable information:

from synapsekit import PIIDetector

detector = PIIDetector()

result = detector.check("My email is alice@example.com and my SSN is 123-45-6789")
# result.passed → False
# result.pii_found → ["email", "ssn"]

# Redact PII
redacted = detector.redact("Call me at 555-0123, my email is alice@example.com")
# redacted → "Call me at [PHONE], my email is [EMAIL]"

TopicRestrictor

Restrict agent conversations to allowed topics:

from synapsekit import TopicRestrictor

restrictor = TopicRestrictor(
    allowed_topics=["python", "programming", "software engineering"],
    llm=llm,
)

result = await restrictor.check("How do I write a Python decorator?")
# result.passed → True

result = await restrictor.check("What is the best pizza in New York?")
# result.passed → False
# result.reason → "Topic not in allowed list"

Guardrails (composed pipeline)

Compose multiple guardrail checks into a single pipeline:

from synapsekit import Guardrails, ContentFilter, PIIDetector, TopicRestrictor

guardrails = Guardrails(
    checks=[
        ContentFilter(),
        PIIDetector(),
        TopicRestrictor(allowed_topics=["customer support"], llm=llm),
    ],
)

# Check input before sending to agent
result = await guardrails.check("My SSN is 123-45-6789, help me with my order")
if not result.passed:
    print(f"Blocked: {result.reason}")  # "PII detected: ssn"

PIIRedactor

Advanced PII handling with reversible masking and LLM integration. Builds on PIIDetector with numbered placeholders, same-value deduplication, and a wrap_generate() helper.

from synapsekit import PIIRedactor

redactor = PIIRedactor(pii_types=["email", "phone"], mode="mask")

# Redact PII with numbered placeholders
result = redactor.redact("Email alice@example.com or call 555-123-4567")
print(result.redacted_text)    # "Email [EMAIL_1] or call [PHONE_1]"
print(result.mapping)          # {"[EMAIL_1]": "alice@example.com", "[PHONE_1]": "555-123-4567"}
print(result.pii_types_found)  # ["email", "phone"]

# Restore original values
restored = redactor.restore(result.redacted_text, result.mapping)
print(restored)  # "Email alice@example.com or call 555-123-4567"

Modes

Mode	Description	Mapping
`"mask"` (default)	Reversible — placeholders can be restored	Yes
`"redact"`	Permanent — no mapping stored	No

Transparent LLM integration

Use wrap_generate() to automatically redact PII before sending to the LLM and restore it in the response:

from synapsekit import PIIRedactor

redactor = PIIRedactor(pii_types=["email"], mode="mask")

# The LLM never sees real PII
restored_response, redaction = await redactor.wrap_generate(
    llm, "My email is alice@example.com, confirm it"
)
# LLM receives: "My email is [EMAIL_1], confirm it"
# LLM responds: "Your email is [EMAIL_1]."
# restored_response: "Your email is alice@example.com."

Same-value deduplication

Identical PII values always get the same placeholder:

result = redactor.redact("Contact alice@example.com. Reply to alice@example.com.")
# "[EMAIL_1]" appears twice, mapping has only one entry

Using with agents

from synapsekit import FunctionCallingAgent, Guardrails, ContentFilter, PIIDetector

guardrails = Guardrails(checks=[ContentFilter(), PIIDetector()])

agent = FunctionCallingAgent(llm=llm, tools=tools)

# Check input
input_check = await guardrails.check(user_input)
if not input_check.passed:
    return f"Sorry, I can't process that: {input_check.reason}"

result = await agent.run(user_input)

# Check output
output_check = await guardrails.check(result)
if not output_check.passed:
    return "Sorry, I can't share that response."

return result

ContentFilter​

PIIDetector​

TopicRestrictor​

Guardrails (composed pipeline)​

PIIRedactor​

Modes​

Transparent LLM integration​

Same-value deduplication​

Using with agents​