1.Database concepts

(1)Data & Information

Information

  • Is any kind of event that affects the state of a dynamic system
  • Is the message (utterance or expression) being conveyed
  • Is an ordered sequence of symbols that can be interpreted as a message

As sensory input: Often be viewed as a type of input to an organism or system

Can be recorded and transmitted

Data

Definition

  • values of a qualitative or quantitative variables, belonging to a set of items
  • Used to record information
  • Is the carrier of Information

Data type and Data value

  • Data Type: the way values of the data can be stored in computer system
  • Data Value: records the meaning of information

Data & Information

Data on its own carries no meaning

Data must be interpreted to take on a meaning, so that the information can be revealed

Data is the lower level of abstraction; is the carrier of information

Information is the interpretation of data

(2)Data Processing VS Data Management

Main categories of Data Manipulation

Data Management: Database

Data Processing: Computer program

Data Transmission: Computer Network

Tasks of Data Management

Data Storage: Organize the data and store them into the storage device such as hard disk;

Data Maintenance: Insert new value, delete invalid data or modify old data;

Data Query & Data statistic: Retrieve information from the data storage

Application Requirements

store the data for a long period of time

  • large amounts (100s of GB)
  • protect against crashes
  • protect against unauthorized use

allow users to query/update:

allow several (100s, 1000s) users to access the data simultaneously

allow administrators to change the schema

Trying Without a DBMS

Storing data: file system is limited

  • size less than 4 GB (on 32 bits machines)
  • when system crashes we may loose data
  • password-based authorization insufficient

Query/update:

  • need to write a new C++/Java program for every new query
  • need to worry about performance

Concurrency: limited protection

  • need to worry about interfering with other users
  • need to offer different views to different users (e.g. registrar, students, professors)

Schema change:

  • entails changing file formats
  • need to rewrite virtually all applications

Schema Versus Data

schema: describes how data is to be structured

defined at set-up time

rarely changes

also called “meta data”

data is actual “instance” of database, changes rapidly

vs. types and variables in programming languages

DBMS

Data Definition Language – DDL

  • Easy to define schema

Data Manipulation Language - DML

  • query language

Storage management

  • Retrieve data from disk automatically for you

Transaction Management

  • concurrency control
  • recovery

Automate a lot of boring/mundane operations on data

  • so that we don’t have to program over and over
  • so that we can write complex data manipulations in just a few lines, so that we can concentrate on app logics

Make execution very fast

  • so that it scales up to very large data sets

Make concurrent access/modification possible

  • so that many users can use the data at the same time

Building an Application with a DBMS

Requirements modeling (conceptual, pictures)

  • Decide what entities should be part of the application and how they should be linked.

Schema design and implementation

  • Decide on a set of tables, attributes.
  • Define the tables in the database system.
  • Populate database (insert tuples).

Write application programs using the DBMS

  • way easier now that the data management is taken care of.

Querying a Database(查询数据库)

Database Technology

Started from 1960’s

Is an important branch of CS

  • Programming language
  • OS
  • DB
  • Network

Is the main component of computer infrastructure

Main Functions of DBMS

Data Definition数据定义

  • Provides Data Definition Language to define schema of database

Data Manipulation数据操作

  • Provides Data Manipulation Language to manipulate data in database: RETRIEVE, INSERT, DELETE, MODIFY

Database operation

  • Security
  • Integrity
  • Concurrency
  • recovery

Toolsets

  • Data loader
  • Monitor
  • Performance tuning tools

(3)Database

  • Efficient, convenient, and safe multi-user storage of massive amounts of well organized persistent data

(4)Database Management System

  • A Software System that manages database
  • Buy, install, set up for particular application

(5)Database System

DBS, information systems that based on database

Consists of database, DBMS, application, and users

硬件<OS<DBMS<App development tools<Database system

2.Development of DB tech

Stages

(1)Manual processing

  • Data stored in punched-cards
  • Data managed by hand
  • No data sharing

(2)File system

  • Data stored in files
  • Use OS IO interface to access data
  • Measures taken to accelerate the data access
  • Primary data-program independence

Drawbacks of file systems

Program-Data Dependence

  • All programs maintain metadata for each file they use

Data Redundancy (Duplication of data)

  • Different systems/programs have separate copies of the same data
  • Multiple file formats, duplication of information in different files
    • Requires space, effort and result in loss of data & metadata integrity

Limited Data Sharing

  • No centralized control of data

    • Each application has its own private files & users has little chance to share data outside their own applications

Lengthy Development Times

  • For each new application programmers must design their own file formats & descriptions from scratch

Excessive Program Maintenance

  • 80% of information systems budget

Difficulty in accessing data

  • Need to write a new program to carry out each new task

Integrity problems

  • Integrity constraints
  • Hard to add new constraints or change existing ones

(3)Database

Main advantages compared to file systems:

data sharing

Less data redundancy

Data- program Independence

Convenient program interface

Efficient data access

Data integrity and data security

Concurrency management

Database advantages

Structured data storage

  • Organize the data and link them together by their inner relations
  • Automatically manage the data relationship

Data sharing

  • One copy of data for many applications
  • Allow many users to access data simultaneously

Less data redundancy

  • 共享数据(对应关系变成表格关系)

Data integrity

  • Automatically check the input value of certain data items, according to the data integrity rules

Concurrency并发性

  • Isolate the concurrent accesses
  • Prevent the dirty use of data
    • The modification that may jeopardize the integrity of data

3.Database Architecture

(1)Database Architecture (physical architecture)

The architecture of a database systems is greatly influenced by the underlying computer system on which the database is running:

  • Centralized集中的
  • Client-server客户端-服务器
  • Parallel (multi-processor)并行(多处理器)
  • Distributed分布式

(2)Database Users

Users are differentiated by the way they expect to interact with the system

  • Application programmers – interact with system through DML calls
  • Sophisticated users – form requests in a database query language
  • Specialized users – write specialized database applications that do not fit into the traditional data processing framework
  • Naive users – invoke one of the permanent application programs that have been written previously. Examples, people accessing database over the web, bank tellers, clerical staff

(3)Database Administrator

Coordinates all the activities of the database system; the database administrator has a good understanding of the enterprise’s information resources and needs.

Database administrator’s duties include:

  • Schema definition
  • Storage structure and access method definition
  • Schema and physical organization modification
  • Granting user authority to access the database
  • Specifying integrity constraints
  • Acting as liaison with users
  • Monitoring performance and responding to changes in requirements

Data Management Technology(1) -- Introduction的更多相关文章

  1. Data Management Technology(5) -- Recovery

    Recovery Types of Failures Wrong data entry Prevent by having constraints in the database Fix with d ...

  2. Data Management Technology(3) -- SQL

    SQL is a very-high-level language, in which the programmer is able to avoid specifying a lot of data ...

  3. Data Management Technology(2) -- Data Model

    1.Data Model Model Is the abstraction of real world Reveal the essence of objects, help people to lo ...

  4. Data Management Technology(4) -- 关系数据库理论

    规范化问题的提出 在规范化理论出现以前,层次和网状数据库的设计只是遵循其模型本身固有的原则,而无具体的理论依据可言,因而带有盲目性,可能在以后的运行和使用中发生许多预想不到的问题. 在关系数据库系统中 ...

  5. Building Applications with Force.com and VisualForce(Dev401)(十六):Data Management: Introduction to Upsert

    Dev401-017:Data Management: Introduction to Upsert Module Objectives1.Define upsert.2.Define externa ...

  6. MySQL vs. MongoDB: Choosing a Data Management Solution

    原文地址:http://www.javacodegeeks.com/2015/07/mysql-vs-mongodb.html 1. Introduction It would be fair to ...

  7. [Windows Azure] Data Management and Business Analytics

    http://www.windowsazure.com/en-us/develop/net/fundamentals/cloud-storage/ Managing and analyzing dat ...

  8. Intel Active Management Technology

    http://en.wikipedia.org/wiki/Intel_Active_Management_Technology Intel Active Management Technology F ...

  9. Data Management and Data Management Tools

    Data Management ObjectivesBy the end o this module, you should understand the fundamentals of data m ...

随机推荐

  1. ubuntu下安装tomcat,shutdown时报错:./catalina.sh:1:eval:/home/xxx/jdk/jre/bin/java:not found

    该问题可能导致tomcat启动成功了,但是浏览器输入http://127.0.0.1:8080无法显示tomcat的欢迎界面 打开Tomcat安装目录下的bin文件下的setclasspath.sh, ...

  2. Java 9 ← 2017,2019 → Java 13,来看看Java两年来的变化

    距离 2019 年结束,只剩下 33 天了.你做好准备迎接 2020 年了吗? 一到年底,人就特别容易陷入回忆和比较之中,比如说这几天的对比挑战就火了! 这个话题登上了微博的热搜榜,也刷爆了朋友圈, ...

  3. C# 从字符串中提取指定字符类型的内容

    从一段字符串中,提取中文.英文.数字 中文字符30Margin中文字符40HorizontalAlignment 正则表达式: /// <summary> /// 英文字母与数字 /// ...

  4. telerik reporting 在.net core api 使用

    工具要求:telerik reporting R3 2019..net core 2.2  .vs2017 最新版 从官网下载下来的的telerik reporting 的.net core  例子是 ...

  5. shiro认证授权

    一.shiro基础概念 Authentication:身份认证 / 登录,验证用户是不是拥有相应的身份: Authorization:授权,即权限验证,验证某个已认证的用户是否拥有某个权限:即判断用户 ...

  6. Mysql悲观锁乐观锁区别与使用场景

    本人免费整理了Java高级资料,涵盖了Java.Redis.MongoDB.MySQL.Zookeeper.Spring Cloud.Dubbo高并发分布式等教程,一共30G,需要自己领取.传送门:h ...

  7. Vue 中的keep-alive 什么用处?

    keep-alive keep-alive是Vue提供的一个抽象组件,用来对组件进行缓存,从而节省性能,由于是一个抽象组件,所以在v页面渲染完毕后不会被渲染成一个DOM元素 <keep-aliv ...

  8. android studio 3.4配置Android -jni 开发基础

    首先下载配置android studio ndk 1.打开sdkManager下载CMake和LLDB 2.配置ndk 项目新建 项目建立完毕后,工程目录如下,cpp文件夹是系统自动生成的 3.自定义 ...

  9. Mysql安装及常用命令

    部分内容转载于 https://www.linuxidc.com/Linux/2016-09/135288.htm https://www.cnblogs.com/rays-/p/8081798.ht ...

  10. Navicat Premium连接mongodb基本使用和介绍

    Navicat premium是一款数据库管理工具,是一个可多重连线资料库的管理工具, 它可以让你以单一程式同时连线到 MySQL.SQLite.Oracle 及 PostgreSQL,mongodb ...