Data Engineering for Cybersecurity: Build Secure Data Pipelines with Free and Open-Source Tools

James Bonifield

Book cover for Data Engineering for Cybersecurity: Build Secure Data Pipelines with Free and Open-Source Tools
Book cover for Data Engineering for Cybersecurity: Build Secure Data Pipelines with Free and Open-Source Tools

Data Engineering for Cybersecurity: Build Secure Data Pipelines with Free and Open-Source Tools

Data Engineering for Cybersecurity: Build Secure Data Pipelines with Free and Open-Source Tools

James Bonifield

Member Benefits

  • 30% Off All Books - Savings that support storytellers, not stock prices.
  • Fight Book Bans - Every membership sends a book to LGBTQ+ youth in affected states.
Member Book Price
$49.99 $34.99
Non-Member Book Price $49.99

An annual membership will be billed at $48/year.

Discount applies to first-time members only. Already a member? Log in here.

View full details

Description

Turn raw logs into real intelligence.

Security teams rely on telemetry--the continuous stream of logs, events, metrics, and signals that reveal what's happening across systems, endpoints, and cloud services. But that data doesn't organize itself. It has to be collected, normalized, enriched, and secured before it becomes useful. That's where data engineering comes in.

In this hands-on guide, cybersecurity engineer James Bonifield teaches you how to design and build scalable, secure data pipelines using free, open source tools such as Filebeat, Logstash, Redis, Kafka, and Elasticsearch and more. You'll learn how to collect telemetry from Windows including Sysmon and PowerShell events, Linux files and syslog, and streaming data from network and security appliances. You'll then transform it into structured formats, secure it in transit, and automate your deployments using Ansible.

You'll also learn how to:
  • Encrypt and secure data in transit using TLS and SSH
  • Centrally manage code and configuration files using Git
  • Transform messy logs into structured events
  • Enrich data with threat intelligence using Redis and Memcached
  • Stream and centralize data at scale with Kafka
  • Automate with Ansible for repeatable deployments

Whether you're building a pipeline on a tight budget or deploying an enterprise-scale system, this book shows you how to centralize your security data, support real-time detection, and lay the groundwork for incident response and long-term forensics.

About the Author

James Bonifield has over a decade of experience analyzing malicious activity, implementing data pipelines, and training others in the security industry. He has built enterprise-scale log solutions, automated detection workflows, and led analyst teams investigating major cyber threat actors. Bonifield holds numerous certifications and enjoys spending time with his family, traveling, and tinkering with all things security and Python related.

Publishing Information

Publisher: No Starch Press
Pub date: 2025-08-26
Length: 344 pages

The Allstora Membership

Membership Perks:

  • Save 30% on all online store purchases
  • Exclusive access to author's content
  • You pay less, but authors still earn double

Membership Terms:

First Month: $0.00
Monthly price: $5.00
  • To access membership discount simply log in and add to cart, discount applied automatically.
  • One month free trial, cancel anytime. Membership renews on the 15th of each month.