Thursday, October 13, 2022

File Formats in Bigdata

 

Impact of Data File Formats in Big Data

  • Imagine you visit a grocery store and nothing is in order, You will find all items on different shelves would this make your shopping experience better? I think the answer is No, In fact, you might never visit this grocery store again.
  • If you understood the example above you now can imagine the impact of unorganized data in a company can be.
  • Every company gets 10s & 1000s of GB data every day. If these are not stored in a proper format, then understanding this data will be difficult or impossible sometimes.
  • More time you spend sorting through the data, The company is missing out on the opportunity to retain customers or generate more orders/revenue.

Use-case 1:

If you are looking into total sales data from a table, Then this requires 1 column in your table sale_amount to be scanned/queried mostly.

Use-case 2 :

If you are trying to identify the consumer behavior :

  • What kind of items are customers placing the order for?
  • Which category of item has the customer placed the most orders from?

No comments:

Post a Comment

Spark- Window Function

  Window functions in Spark ================================================ -> Spark Window functions operate on a group of rows like pa...