Data Warehousing And Data Mining

Course Code: MCA5143

Course Title: Course Title: Advanced Web Programming (4 Credits)



Course Contents


UNIT 1: Data Warehouse Fundamentals: Introduction   to Data Warehouse, OLTP Systems; Differences between   OLTP Systems and Data Warehouse: Characteristics of Data Warehouse; Functionality of Data Warehouse: Advantages and Applications of Data Warehouse; Advantages, Applications: Top- Down   and Bottom-Up Development Methodology: Tools for Data warehouse development: Data Warehouse Types:


Unit 2: Planning and Requirements: Introduction: Planning Data Warehouse and Key Issues: Planning and Project Management in constructing Datawarehouse: Data Warehouse Project; Data Warehouse development Life Cycle, Kimball Lifecycle Diagram, Requirements Gathering Approaches: Team organization, Roles, and Responsibilities:


Unit 3: Data Warehouse Architecture: Introductions, Components of Data warehouse Architecture: Technical Architectures; Data warehouse architectures 1: Data warehouse architecture 2: Data warehouse architecture 3: Tool selection:  Federated Data Warehouse Architecture:


Unit 4: Dimensional Modeling: Introduction: E-R Modeling: Dimensional Modeling: E-R Modeling VS Dimensional Modeling: Data Warehouse Schemas; Star Schema, Inside Dimensional Table, Inside Fact Table, Fact Less Fact Table, Granularity, Star Schema Keys: Snowflake Schema: Fact Constellation Schema:


Unit 5: Extract, Transform and Load: Introduction: ETL Overview or Introduction to ETL: ETL requirements and steps: Data Extraction; Extraction Methods, Logical   Extraction   Methods, Physical Extraction Methods: Data Transformation; Basic Tasks in Transformation, Major Data Transformation Types: Data loading; Data Loading Techniques: ETL Tools:


Unit 6: Data Warehouse & OLAP: Introduction:  What is OLAP?; Characteristics of OLAP, Steps in the OLAP Creation Process, Advantageous of OLAP: What is Multidimensional Data: OLAP Architectures; MOLAP, ROLAP, HOLAP: Data Warehouse and OLAP: Hypercube & Multicubes:


Unit 7: Meta data Management in Data Warehouse: Introductions to Metadata: Categorizing Meta data: Meta data management in practice; Meta data requirements gathering, Meta data classification, Meta data collection strategies: Meta Data Management in Oracle and SAS: Tools for Meta data management:


Unit 8: Introduction to Data Mining: Introduction: Scope of Data Mining: What is Data Mining; How does Data Mining Works, Predictive Modeling: Data Mining and Data Warehousing: Architecture for Data Mining: Profitable Applications: Data Mining Tools:


Unit 9: Business Intelligence: IntroductionBusiness Intelligence, Business Intelligence tools, Business Intelligence Infrastructure, Business Intelligence Applications, BI versus Data Warehouse, BI versus Data Mining, Future of BI.


Unit 10: Data Preprocessing: Introduction, Data Preprocessing Overview, Data Cleaning, Data Integration and Transformation, Data Reduction, Discretization and Concept Hierarchy Generation.


Unit 11: Data Mining Techniques- An Overview: Introduction, Data Mining, Data Mining Versus Database Management System, Data Mining Techniques- Association rules, Classification, Regression, Clustering, Neural networks.


Unit 12: Clustering: Introduction, Clustering, Cluster Analysis, Clustering Methods- K means, Hierarchical clustering, Agglomerative clustering, Divisive clustering, clustering and segmentation software, evaluating clusters.


Unit 13: Web Mining: Introduction, Terminologies, Categories of Web Mining – Web Content Mining, Web Structure Mining, Web Usage Mining, Applications of Web Mining, and Agent based and Data base approaches, Web mining Software.


Unit 14: Applications of Data mining: Introduction, Business Applications Using Data Mining- Risk management and targeted marketing, Customer profiles and feature construction, Medical applications (diabetic screening), Scientific Applications using Data Mining, Other Applications.