Logs/Traces Sensitive Data Obfuscation
groundcover’s pipelines can be used to protect sensitive data in your Logs and Traces using Vector's redact function. Mask or remove sensitive information while preserving the usefulness of your data.
groundcover’s pipelines can be used to protect sensitive data in your Logs and Traces. With Vector's redact
function, you can mask or remove sensitive information while preserving the usefulness of your observability data.
We highly recommend using Vector's built-in function redact
for Logs/Traces obfuscation. This powerful function allows you to configure simple yet effective redaction rules to protect sensitive information in your logs and traces.
With redact
, you can:
Mask or remove sensitive data from strings, arrays, or objects
Replace text matching specified patterns (like regex) with a placeholder, custom text, or a hash (SHA-2 or SHA-3)
Please refer to the redact
function's documentation for more details.
On this page, we'll explore how to leverage the redact
function and VRL's capabilities to obfuscate PII in Logs and Traces. At the end of this page, you'll find a handy list of regex patterns to save you time and effort.
Trace obfuscation can also be configured directly in the sensor. Can't find what you're looking for? Let us know over Slack.
Examples
In the examples below, we redact both the log contents (.content
) and any attributes derived from the structued logs (.string_attributes
).
Obfuscate credit card numbers from Logs
In this example, we'll obfuscate Visa credit card numbers from logs using the Visa credit card regex pattern from the library. By not specifying a redactor type, the redact
function will default to full redaction, replacing detected numbers with the string “[REDACTED].”
Here's an example of how Logs appear before and after obfuscation:
Hash US SSNs in Logs
In this example we’ll hash of all US Social Security Numbers hidden in logs. We’ll pass the sha2
parameter to the redactor
to hash the sensitive values.
Here's how logs appear before and after obfuscation:
Obfuscate IPs with Two Stages from Logs This example demonstrates how to obfuscate IP addresses in logs using a two-stage approach:
Here's an example of how logs appear before and after this obfuscation:
Patterns Library
Credit Card Scanners
Description | Pattern |
---|---|
Maestro Card (16 digits) |
|
Discover Card (16 digits) |
|
Diners Club (14 digits) |
|
American Express (15 digits) |
|
JCB Card (16 digits) |
|
MasterCard (16 digits) |
|
Visa Card (16 or 19 digits) |
|
API Key and Token Scanners
Description | Pattern |
---|---|
AWS Access Key ID and Secret Access Key |
|
Google API Key and OAuth Access Token |
|
Mailchimp API Key |
|
Social Media Tokens (Facebook, Slack, Twitter, Instagram, LinkedIn) |
|
Azure Personal Access Token |
|
Azure SQL Connection String |
|
Azure Subscription Key |
|
GitHub Access Token and Refresh Token |
|
Shopify Access Token and Shared Secret |
|
Okta API Token |
|
JSON Web Token (JWT) |
|
RSA Private Key |
|
PGP Private Key |
|
GitLab Token |
|
Amazon Marketplace Web Services Auth Token |
|
Bearer Token |
|
JIRA API Token |
|
Other Scanners
Description | Pattern |
---|---|
Standard Email Address |
|
Standard IBAN Code |
|
Standard MAC Address |
|
IPv4 Address |
|
IPv6 Address |
|
HTTP(S) URL |
|
HTTP Basic Authentication Header |
|
HTTP Cookie |
|
US Passport Number |
|
US Vehicle Identification Number (VIN) |
|
UK National Insurance Number |
|
Canadian Social Insurance Number (SIN) |
|
US Social Security Number (SSN) |
|
Last updated