Jasper Alblas
Jasper Alblas
Mastering Data & Cybersec
Before you invest months learning a new skill, you deserve an honest answer to the question every newcomer is actually thinking: is this worth it?
Not the marketing version. The real version — what the job pays, what the market looks like, how long it genuinely takes to get hired, what kind of person thrives in this role, and what the alternatives are if data engineering turns out not to be the right fit.
That’s what this article is. No hype, no vague promises. By the end, you’ll know whether data engineering is the right path for you, or whether a related role — data analyst or analytics engineer — might be a better match.
Data engineers build the infrastructure that makes data useful. They write pipelines that move data from source systems — databases, APIs, sensors, event streams — into storage systems where analysts, data scientists, and business intelligence tools can use it.
A realistic day looks something like this: debugging a pipeline that silently failed overnight, reviewing a pull request from a colleague adding a new data source, writing SQL to build a new dimension table for the analytics team, and joining a meeting with a data analyst to understand what format they need a new dataset in.
It is a technical, collaborative, infrastructure-focused role. If you enjoy building systems, solving “why is this broken?” problems, and seeing your work enable other people to do their jobs better — this is a deeply satisfying career. If you want to be the person presenting insights and recommendations to the business, you likely want to be a data analyst instead.
Data engineering is one of the fastest-growing roles in tech. The US Bureau of Labor Statistics projects roles in this field to grow around 8% over the coming decade — significantly faster than the average occupation. Demand spans every sector: finance, healthcare, e-commerce, logistics, media, manufacturing, and more.
Critically, the AI boom has increased demand for data engineers rather than threatening it. Every AI model, every LLM-powered product, every machine learning pipeline requires clean, reliable data infrastructure underneath. The more AI a company adopts, the more data engineers it needs.
Remote and hybrid work is widespread in this field. Data engineering is a well-suited remote role — the work is code-based, asynchronous-friendly, and doesn’t require physical presence. Many European companies hire across borders.
Salaries vary significantly by country, company size, and seniority. Here are realistic ranges for Europe based on current market data:
| Level | Experience | Salary Range |
|---|---|---|
| Junior | 0–2 years | €44,000 – €65,000 |
| Mid-level | 3–6 years | €65,000 – €90,000 |
| Senior | 7+ years | €90,000 – €120,000+ |
In Germany specifically, junior engineers typically earn between €44,000 and €54,000, mid-level engineers between €65,000 and €85,000. According to Hays, even as a beginner you are already among the higher earners in the IT profession, with starting salaries between €44,700 and €54,300 per year.
Switzerland is an outlier on the high end — the median data engineer salary there is around CHF 111,250 (approximately €115,000) per year.
In the United States, junior data engineers typically earn $90,000–$120,000, mid-level $120,000–$160,000, and senior engineers $160,000–$220,000+.
These figures are base salary only. Most mid-to-senior roles include bonuses and, at larger tech companies, significant stock compensation on top.
Data engineering tends to pay more than data analysis at equivalent experience levels, and roughly the same as analytics engineering. Software engineering is comparable, though senior software engineers at large tech companies often earn more.
Honestly: 6–12 months of consistent, focused study and project work is a realistic timeline for someone starting from a basic programming background. Less if you’re coming from software engineering or analytics. More if you’re starting completely from scratch.
The single biggest factor is whether you have something real to show. A well-documented GitHub project demonstrating a working pipeline — ingestion, transformation, modeling, orchestration — is worth more in most hiring processes than certifications alone.
This is exactly why this series is built around a single project, ShopFlow Analytics, that you develop from Part 2 through the capstone. By Part 7 of this series you have a production-grade, fully orchestrated pipeline with a Power BI dashboard and a documented GitHub repo. That’s a portfolio piece.
A computer science degree helps but is not required. What most employers actually care about is demonstrable skill — can you write Python, can you model data in SQL, do you understand how a pipeline works, do you use version control?
Most employers prefer a Master’s degree for this role, which means Bachelor’s graduates can generally expect a slightly lower starting salary. That said, career changers without relevant degrees are hired regularly when they have strong project portfolios and can pass a technical interview.
The skills gap in data engineering is large enough that companies are increasingly willing to look beyond traditional credentials.
If you’re deciding between paths, here’s a clear breakdown:
A data analyst uses existing data to answer business questions — writing SQL, building dashboards, presenting insights. A data engineer builds the infrastructure that makes the analyst’s data available and reliable.
Analysts need less programming depth but more business acumen and communication skills. Engineering needs deeper programming skills and systems thinking. Analyst salaries are typically lower at equivalent experience levels. If you like the consumption of data more than the construction of it, start with analytics.
Analytics engineering is a newer, hybrid role that sits between the two. Analytics engineers focus on transforming data inside the warehouse using tools like dbt — writing clean, tested SQL models that analysts query directly. It’s less infrastructure-heavy than data engineering and more code-heavy than analysis.
If you enjoy SQL and want to work close to the business without managing pipelines and cloud infrastructure, analytics engineering is worth considering. Many data engineers started as analytics engineers and moved up.
The skills overlap significantly — both roles write production code, use version control, test their work, and think about systems. The difference is the domain: data engineers specialise in data infrastructure, while software engineers build user-facing applications and services.
If you already work in software engineering, transitioning to data engineering is one of the most natural moves in tech. Your programming fundamentals transfer directly.
Based on the nature of the role, data engineers tend to do well when they:
Enjoy debugging. A significant portion of the job is figuring out why something broke, why a number looks wrong, or why a pipeline is slower than it should be. If you find this satisfying rather than frustrating, that’s a good sign.
Think in systems. Data engineering requires holding multiple layers of a system in your head simultaneously — where the data comes from, how it’s transformed, where it lands, who uses it, and what breaks if any layer changes.
Are comfortable with ambiguity. Requirements from stakeholders are often incomplete. “Can you get me the sales data by region?” hides a dozen decisions about granularity, history, and definition that you’ll need to uncover.
Like clean, well-structured work. The best data engineers care about code quality, documentation, and reproducibility. Messy pipelines are a debt that compounds fast.
Are collaborative without needing the spotlight. Data engineers typically work behind the scenes, enabling other people’s output. The recognition is indirect — a dashboard works because the pipeline works. If you need direct credit for business outcomes, analytics or science might suit you better.
No career guide is complete without them.
On-call is real. Pipelines fail. They fail at night, on weekends, during holidays. At most companies with production data pipelines, someone is on rotation. It’s manageable with good tooling and runbooks — which is exactly why Part 7 of this series covers observability and alerting — but it’s part of the job.
The tooling landscape moves fast. The modern data stack changes significantly every 2–3 years. Tools that were standard in 2020 are being replaced today. Continuous learning isn’t optional — it’s the job.
Junior roles are competitive. Despite strong demand at mid and senior levels, junior data engineering positions attract many applicants. A solid portfolio and clear interview preparation matter more than ever.
It can feel invisible. When your pipelines run perfectly, nobody notices. When they break, everyone does. The nature of infrastructure work is that success is invisible and failure is loud.
If you want to build things, enjoy solving technical problems, are comfortable with the learning curve, and want a well-paying career with strong long-term prospects — yes.
The Junior Track of this series (Parts 1–7) gives you everything needed to land a first role: Python, SQL, dimensional modeling, Power BI, dbt, Docker, Airflow, and a production-grade portfolio project. It’s designed for someone starting from a basic programming background, with no prior data engineering experience required.
If you’re unsure whether data engineering is right for you, the best thing you can do is start Part 2 and build the first version of the ShopFlow pipeline. An hour of hands-on work will tell you more about fit than any career guide.
Is data engineering still a good career with AI taking over? More stable than most. Every AI system requires reliable data pipelines to train and operate. Companies investing in AI are simultaneously investing heavily in data infrastructure. Data engineering roles show high demand with stable jobs even during layoffs, as core data teams tend to be protected.
Do I need to know machine learning to be a data engineer? No. Data engineers support machine learning teams but are not responsible for building models. You need to understand the interface — what a feature store is, how training data pipelines work — but you don’t need to be a data scientist. This series covers the ML handoff in Part 9.
What programming languages do I need? Python and SQL are the two non-negotiables. Python for pipeline logic, SQL for querying and modeling. This series teaches both, starting in Part 2 (Python) and Part 3 (SQL).
Can I become a data engineer without a maths background? Yes. Data engineering is much closer to software engineering than to statistics or machine learning. Heavy mathematics is not required. Logic, systems thinking, and programming are the core skills.
How is the work-life balance? Generally good at most companies. Better than software engineering roles that ship consumer products with aggressive release cycles. The on-call element can disrupt evenings occasionally, but mature data teams invest in good monitoring and runbooks to minimise this.
Next up → 1.2: Fundamentals: The World of Data — the mental model that every tool, concept, and technique in this series plugs into.