Auto saved by Logseq

This commit is contained in:
Ryan 2023-06-14 16:42:25 +08:00
parent b1f19e9713
commit 0699d6bcb8

View file

@ -957,7 +957,7 @@
- DONE Review relational algebra https://www.geeksforgeeks.org/introduction-of-relational-algebra-in-dbms/ - DONE Review relational algebra https://www.geeksforgeeks.org/introduction-of-relational-algebra-in-dbms/
- LATER review lab2 - LATER review lab2
- LATER SQL join - LATER SQL join
- LATER Block 2 - DONE Block 2
- DONE EER - DONE EER
- Most useful additional concept of EER model: - Most useful additional concept of EER model:
specialization/generalization. specialization/generalization.
@ -1005,7 +1005,10 @@
- The instances of a potential subclass participate in a relationship unique to that subclass. - The instances of a potential subclass participate in a relationship unique to that subclass.
- DONE Block 3 - DONE Block 3
- DONE DB transaction management - DONE DB transaction management
- LATER anomalies - NOW anomalies
:LOGBOOK:
CLOCK: [2023-06-14 Wed 16:41:52]
:END:
- DONE Deadlock and how it can be resolved. #flashcard - DONE Deadlock and how it can be resolved. #flashcard
- A deadlock is a situation in which two or more transactions are unable - A deadlock is a situation in which two or more transactions are unable
to proceed because each is waiting for a resource held by the other, to proceed because each is waiting for a resource held by the other,
@ -1073,31 +1076,24 @@
- Boyce-Codd Normal Form (BCNF): BCNF is a stricter form of 3NF that ensures that each determinant in a table is a candidate key. In other words, BCNF ensures that _each non-key attribute is dependent only on the candidate key._ - Boyce-Codd Normal Form (BCNF): BCNF is a stricter form of 3NF that ensures that each determinant in a table is a candidate key. In other words, BCNF ensures that _each non-key attribute is dependent only on the candidate key._
- Fourth Normal Form (4NF): 4NF is a further refinement of BCNF that ensures that _a table does not contain any multi-valued dependencies._ - Fourth Normal Form (4NF): 4NF is a further refinement of BCNF that ensures that _a table does not contain any multi-valued dependencies._
- Fifth Normal Form (5NF): 5NF is the highest level of normalization and involves decomposing a table into smaller tables to _remove data redundancy and improve data integrity._ - Fifth Normal Form (5NF): 5NF is the highest level of normalization and involves decomposing a table into smaller tables to _remove data redundancy and improve data integrity._
- DONE Block 4 - LATER Block 4
- DONE Distributed DBMS - DONE Distributed DBMS
collapsed:: true
- DONE client server arch - DONE client server arch
collapsed:: true
- Computers (client) connected over wired or wireless local area network (LAN) - Computers (client) connected over wired or wireless local area network (LAN)
- The database itself and the DBMS are stored on a central device called the database server, which is also connected to the network. - The database itself and the DBMS are stored on a central device called the database server, which is also connected to the network.
- Distributed Database - Distributed Database
collapsed:: true
- A logically interrelated collection of shared data (and a description of this data), physically spread over a computer network. - A logically interrelated collection of shared data (and a description of this data), physically spread over a computer network.
- Distributed DBMS - Distributed DBMS
collapsed:: true
- Software system that permits the management of the distributed database and makes the distribution transparent to users. - Software system that permits the management of the distributed database and makes the distribution transparent to users.
- the key issues - the key issues
collapsed:: true
- Fragmentation - Fragmentation
- Allocation - Allocation
- Replication - Replication
- importance and different types of fragmentation - importance and different types of fragmentation
collapsed:: true
- Horizontal - Horizontal
- Vertical - Vertical
- Mixed - Mixed
- different types of transparency - different types of transparency
collapsed:: true
- Distribution Transparency: The database feels as a single, logical entity - Distribution Transparency: The database feels as a single, logical entity
- Transaction Transparency: Ensures that all distributed transactions maintain distributed databases integrity and consistency. - Transaction Transparency: Ensures that all distributed transactions maintain distributed databases integrity and consistency.
- Performance Transparency: must perform as if it were a centralized DBMS. - Performance Transparency: must perform as if it were a centralized DBMS.
@ -1125,54 +1121,41 @@
- XSD: a more comprehensive method of defining content model of - XSD: a more comprehensive method of defining content model of
an XML document. an XML document.
- DONE Data Mining - DONE Data Mining
collapsed:: true
- concept - concept
collapsed:: true
- The process of extracting valid, previously unknown, comprehensible, - The process of extracting valid, previously unknown, comprehensible,
and actionable information from large databases and using it to make and actionable information from large databases and using it to make
crucial business decisions. crucial business decisions.
- different applications - different applications
collapsed:: true
- Retail / Marketing - Retail / Marketing
- Banking - Banking
- Insurance - Insurance
- Medicine - Medicine
- basic techniques - basic techniques
collapsed:: true
- predictive modelling, - predictive modelling,
collapsed:: true
- uses observations to form a model of the important characteristics of some - uses observations to form a model of the important characteristics of some
phenomenon phenomenon
- database segmentation, - database segmentation,
collapsed:: true
- Uses unsupervised learning to discover homogeneous subpopulations in a database to improve the accuracy of the profiles. - Uses unsupervised learning to discover homogeneous subpopulations in a database to improve the accuracy of the profiles.
- link analysis, - link analysis,
collapsed:: true
- Establishing links, called associations, between the individual - Establishing links, called associations, between the individual
records, or sets of records, in a database. records, or sets of records, in a database.
- deviation detection. - deviation detection.
collapsed:: true
- Identifies outliers, which express deviation from some previously - Identifies outliers, which express deviation from some previously
known expectation and norm. known expectation and norm.
- DONE NoSQL - DONE NoSQL
collapsed:: true
- the motivation for NoSQL - the motivation for NoSQL
collapsed:: true
- By giving up ACID constraints, one can achieve - By giving up ACID constraints, one can achieve
much higher performance and scalability. much higher performance and scalability.
- explain the concepts of NoSQL - explain the concepts of NoSQL
collapsed:: true
- NoSQL databases (aka "not only SQL") are non-tabular databases and store - NoSQL databases (aka "not only SQL") are non-tabular databases and store
data differently than relational tables. NoSQL databases come in a data differently than relational tables. NoSQL databases come in a
variety of types based on their data model. The main types are document, variety of types based on their data model. The main types are document,
key-value, wide-column, and graph. They provide flexible schemas and key-value, wide-column, and graph. They provide flexible schemas and
scale easily with large amounts of data and high user loads. scale easily with large amounts of data and high user loads.
- explain the application areas of NoSQL - explain the application areas of NoSQL
collapsed:: true
- NoSQL is an alternative, non-traditional DB technology to be - NoSQL is an alternative, non-traditional DB technology to be
used in large scale environments where (ACID) transactions are not a priority. used in large scale environments where (ACID) transactions are not a priority.
- CAP theorem: - CAP theorem:
collapsed:: true
- There are 3 main properties for distributed management: - There are 3 main properties for distributed management:
1. Consistency → A data item has the same value at the same time (to 1. Consistency → A data item has the same value at the same time (to
ensure coherency). ensure coherency).