Databricks stands out as a widely embraced platform dedicated to the creation of data lakes. Within its framework, it extends support to a specialized version of Structured Query Language (SQL) known as Spark SQL. If you are interested in learning more about how to use Spark SQL to analyze data in a data lake, then this book is for you.The book covers everything from basic queries to complex data-processing tasks.
It begins with an introduction to SQL and Spark. It then covers the basics of SQL, including data types, operators, and clauses. The next few chapters focus on filtering, aggregation, and calculation. Additionally, it covers dates and times, formatting output, and using logic in your queries It also covers joining tables, subqueries, derived tables, and common table expressions.
Format |
Häftad |
Omfång |
556 sidor |
Språk |
Engelska |
Förlag |
BPB Publications |
Utgivningsdatum |
2023-11-06 |
ISBN |
9789355518019 |