Parquet – Performance Benchmark

If you are following today’s trend for building an efficient Modern Data Stack, all are of aware of the “parquet” format offering efficient data storage and retrieval. In addition to being an in-memory column-oriented storage format similar to ORC, it also provides features such as efficient data compression and encoding schemas, resulting in enhanced performance

Loading

Subject Area Bootstrapping in Azure Synapse Analytics

We all know about the AdventureWorks database from MS SQL Server, a replacement for Northwind and Pubs database. Just released recently, the Synapse Analytics team has introduced Templates for various subject areas around business areas. To name a few are Customer, Contract, Order, Party etc. The feature also has the flexibility of adding/removing entities and

Loading

Rental Demand Forecast Analysis using Python

Supply & Demand forecasting is a hot topic in the machine learning community.  The importance of operations management is widely understood for goods, commodities, and even in the service industry. Supply chain disruptions like machinery breakdown, quality concerns, inaccurate inventory records, poor forecasting, capacity, or labor shortages cause millions in losses.  Machine learning algorithms are being

Loading

SSMS Top N Rows

Whenever we work with Azure Synapse through SSMS, we want to quickly preview the top N data. A quick way of achieving it is right-click on the table / view and select but, sometimes we don’t need the 1000’s of rows. Maybe 100 or 10 is sufficient enough. To change the default 1000 rows count,

Loading