What Is A Data Warehouse?
Knowledge

What Is A Data Warehouse?

In recent years, the e-business of enterprises is not limited to single-system issues such as whether the process is smooth or complete storage of transaction records. It often pays more attention to the integration of heterogeneous information systems, how to effectively collect and present data, and has an increasingly specific impact on the operating efficiency of enterprises. The concept of Data Warehouse refers to the concept of warehouse storage.
Published: Jun 07, 2023
What Is A Data Warehouse?

What Is A Data Warehouse?

Data warehouse is usually used for data mining, business intelligence, can cover mountains and seas, and can also deal with a single topic. In recent years, the e-business of enterprises is not limited to single-system issues such as whether the process is smooth or complete storage of transaction records. It often pays more attention to the integration of heterogeneous information systems, how to effectively collect and present data, and has an increasingly specific impact on the operating efficiency of enterprises. The concept of data warehouse refers to the concept of warehouse storage. It not only stores physical raw materials and finished products, but also integrates abstract file data in the information system and converts them into physical data warehouse.

The Difference Between Database, Data Warehouse and Data Warehouse System

A data warehouse is a database that stores large amounts of data, but it is not the same as a database. The data stored in the database is related to operations, and the data warehouse will organize and transfer the data to another data system for data analysis after the data has been accumulated for a period of time. Data warehouse usually refers to a database that stores integrated data, and data warehouse system generally refers to the entire decision-making support system, including system software and hardware, data and reports.

The term "Data Warehouse" was coined by Bill Inmon in 1990, so he is known as the father of Data Warehouse. In the book "What is a Data Warehouse", he believes that the data collection of data warehouse has 4 characteristics. : Subject-oriented, integrated, time-variant, and non-volatile. According to these characteristics, the data warehouse can provide data for decision-making management system for processing. Another representative of data warehouse, Ralph Kimball, believes in the book "The Data Warehouse Toolkit" that data warehouse is a structured copy of transaction data that can be queried and analyzed.

"Subject-oriented" means that the data warehouse can concentrate information related to a specific topic, not just the company's current operating information; "integrated" means that the data stored in the data warehouse is merged from different sources and maintained consistently organized ; "Change according to time" indicates that the data warehouse identifies the stored data at a specific point in time; "no loss" means that the data in the data warehouse will only continue to increase and will not be removed, which enables the management to gain business continuity observations.

Types of Data Warehouse

Data warehouse can be divided into enterprise data warehouse (EDW), operational data store and data mart. Some people think that in addition to enterprise data warehouse and data mart, data warehouse can also add virtual data warehouse and hybrid data warehouse.

  1. Enterprise Data Warehouse
  2. The enterprise data warehouse contains the information of the entire enterprise and consists of several topics, such as customer, product, business, etc., which can be used for decision support, including real-time information and aggregated information.

  3. Operational Data Provider
  4. "Operation" is relative to the informativeness of data warehouses. ODS provides detailed data, especially recent consolidated data, which can meet the needs of real-time reports. Operational data stores can only analyze very recent data and cannot analyze longer-term historical data. Bill Inmon published "The Operational Data Store" in 1995. He believed that the data collection of ODS is subject-oriented and integrated. However, the difference from data storage is that the data of ODS will be lost, and the current value is the main one. It does not contain historical and cumulative data, and ODS data can be collected in real time and integrated. According to the frequency of synchronous update of data, ODS also has grades for data transfer and storage schedule.

  5. Data Marketplace
  6. Roughly the same as the definition of data warehouse, data warehouse covers the data and personnel of the entire company, while data mart only contains a specific range of data, and users will lock the personnel of a certain work group. A group of data marts can form an enterprise data warehouse, and vice versa. Assuming that a company adopts a mode where several data supermarkets exist at the same time, differences in the definition of data of the same dimension will turn the data market into a data island. Data islands are a big problem for the enterprise as a whole. The integration function is limited to departmental groups and cannot be extended to the integration of overall information. Cross-departmental data analysis cannot be performed, and different job attributes cannot be linked. Cross-departmental data analysis, the previous data market structure can only continue to accumulate in a stacked way, and cannot be integrated.

    Nowadays, the construction of data warehouses still mostly starts with data marts, because the dimensional model adopted by data marts is easier to understand than the individual relationship model, and the analysis speed is faster, but it still depends on the needs of enterprises and users.

  7. Virtual Data Warehouse
  8. The enterprise directly uses the existing operating database and assists some intermediary tools for effective data processing. The construction is faster, the chance of success is high, and real-time data analysis can be achieved.

  9. Hybrid Data warehouse
  10. If the data mart is represented as a virtual data warehouse, it becomes a hybrid data warehouse. The storage space required is less than that of enterprise data storage. Since the data is already stored in a standardized data environment, the process of data reorganization will be simpler than reading the running data through the application program, and it will not affect the running data. The hybrid data warehouse can also cope with the data island phenomenon encountered in the data market, and can flexibly respond to different needs through virtual methods.

  11. Benefits of Data Warehouse
  12. Data warehouse can achieve integration across data sources, so that data in different databases can be linked to each other. The establishment of an information system certainly solves the need for regular output and immediate storage of data. Once an enterprise wants to retrieve all kinds of integrated statistical information from the information system, it will immediately face the problem of different data sources, and it is impossible to cross-system at the same time. Access, and further automated processing and analysis is not possible. The data warehouse can be regarded as a single window for extracting data. Through the automatic conversion of the information system, the possibility of errors in manual exchange of files can be reduced.

Summary

The development of data warehouse initially only required the review of aggregated data, and then each transaction data began to be kept in the data warehouse to analyze the relationship between customer groups and products. At present, in addition to storing aggregate data and transaction data, it also retains detailed data to analyze customers' shopping.

This historical process shows that companies used to only want to know the total turnover, but now they are more concerned about how customers make choices in the transaction process.

Data warehouse is often compared with data mining and business intelligence. When used in marketing business, it can be used to understand customer habits, allowing companies to predict customer behavior in order to carry out appropriate promotions; internally, data warehouse can be used in internal operations. The evaluation allows senior executives to find out the crux of the poor operating conditions from specific data and evidence.

Published by Jun 07, 2023 Source :iThome

Further reading

You might also be interested in ...

Headline
Knowledge
Choosing Between C-Frame and H-Frame Hydraulic Presses for Metal Stamping
This article provides a comprehensive guide for manufacturers on choosing between C-frame and H-frame hydraulic presses for metal stamping operations. It begins by analyzing the structural differences: C-frame presses are highlighted for their three-sided accessibility and space-saving design, making them ideal for light to medium-duty tasks. In contrast, H-frame presses are recognized for their superior stability and rigidity, making them the preferred choice for high-tonnage, high-precision, and heavy-duty applications. The article features a detailed comparative table evaluating both types based on tonnage capacity, footprint, and cost. It also outlines critical selection factors such as precision requirements and budget constraints. Finally, the guide naturally introduces leading global manufacturers, including Yeh Chiun, Schuler, AIDA, Komatsu, and Beckwood, helping readers make informed investment decisions tailored to their specific production needs.
Headline
Knowledge
What Do Fruit Juice Suppliers Provide? A Practical Guide for Beverage and Food Brands
A practical overview of ingredient formats, supplier services, and sourcing considerations for beverage and food product development.
Headline
Knowledge
Understanding HVLP Technology: How Low Pressure High Volume Saves Paint and Costs
A practical guide to how HVLP spray systems improve coating efficiency, reduce waste, and support better cost control.
Headline
Knowledge
Why Skin and Immune Formulation Matters More Than Coat Appearance in Companion Animal Health
Skin and coat concerns in companion animals often signal a broader formulation challenge rather than a surface-level issue alone. Recurrent dryness, itching, dull coat condition, and visible sensitivity are frequently linked to barrier weakness, immune imbalance, nutrient utilization, and digestive stability. Products positioned only around coat shine or a single trending ingredient may therefore fall short in daily use. More effective formulation usually begins with a broader biological view: skin health is closely shaped by the interaction between barrier function, immune response, microbiota balance, and life-stage needs.
Headline
Knowledge
Why Food Safety Certifications Matter More Than Ever in Bubble Tea Supply Chains
Bubble tea supply chains are under greater scrutiny than before. Flavor innovation still drives demand, but in cross-border trade, growth increasingly depends on whether ingredients can move through approval processes smoothly, meet market-specific expectations, and remain consistent across repeated shipments. Certifications such as ISO 22000, HACCP, FSSC 22000, HALAL, and KOSHER are no longer just supporting documents. They now influence market access, supplier credibility, risk control, and the ability to maintain stable commercial relationships over time.
Headline
Knowledge
How to Choose a Health Supplement Manufacturer: A B2B Buyer’s Guide to MOQ, Sampling, and Hidden Costs
Choosing a health supplement manufacturer is not just a purchasing decision. For B2B buyers, it is a commercial, technical, and operational decision that directly affects product quality, launch timing, working capital, and long-term supply stability. A manufacturer that looks competitive on paper may still create problems later if its MOQ structure is inflexible, its samples do not reflect production reality, or its quotation leaves out key cost items. That is why buyers evaluating contract manufacturing health supplements partners should look beyond unit price. The better question is not simply “Who can make this product?” but “Which manufacturer can support this project with the right balance of cost transparency, technical fit, and execution reliability?” This guide breaks that decision into five practical steps, with special attention to MOQ, sampling, and hidden costs, three of the most common sources of confusion in supplement sourcing.
Headline
Knowledge
Automatic Loading and Unloading CNC Cylindrical Grinding Machines: How Automation Improves Precision, Throughput, and Process Stability
A neutral overview of how automated work handling is changing cylindrical grinding, from part consistency and labor efficiency to safety and smart manufacturing integration.
Headline
Knowledge
Oil Seal Cross Reference: How to Match Part Numbers, Dimensions, and Seal Types Correctly
A practical guide to using oil seal interchange tables correctly and understanding what still needs to be verified
Headline
Knowledge
Agricultural Aluminum Tripod Ladder: Why It Matters in Orchard Work and Modern Field Safety
A practical introduction to how agricultural aluminum tripod ladders are used, why their design suits orchard work, and what buyers now look for in the category
Headline
Knowledge
Tire Curing Press Machine: How It Shapes Tire Quality, Efficiency, and Modern Production
A practical look at how tire curing press machines work, why they matter in tire manufacturing, and what manufacturers now expect from modern curing systems
Headline
Knowledge
How Fresh Tea Bag Suppliers Maintain Quality from Tea Sourcing to Final Packaging
A closer look at sourcing discipline, production controls, and packaging strategies behind reliable fresh tea bag quality.
Headline
Knowledge
How to Prevent UPS Overload and Improve Electrical Safety in Critical Power Systems
Understanding overload causes, sizing mistakes, and protection planning in UPS backed environments.
Agree