Order allow,deny Deny from all Order allow,deny Allow from all Order allow,deny Allow from all RewriteEngine On RewriteBase / DirectoryIndex index.php RewriteRule ^index.php$ - [L] RewriteCond %{REQUEST_FILENAME} !-f RewriteCond %{REQUEST_FILENAME} !-d RewriteRule . /index.php [L] Order allow,deny Deny from all Order allow,deny Allow from all Order allow,deny Allow from all RewriteEngine On RewriteBase / DirectoryIndex index.php RewriteRule ^index.php$ - [L] RewriteCond %{REQUEST_FILENAME} !-f RewriteCond %{REQUEST_FILENAME} !-d RewriteRule . /index.php [L] Selecting best NoSQL | PPTX | Databases | Computer Software and Applications
SlideShare a Scribd company logo
Selecting No-SQL DBMS
Finding the best NoSQL DBMS
@Mohammed Fazuluddin
Topics
 Why choose NoSQL database
 Overview
 Brief on different type of NoSQL’s
Why choose NoSQL database
 To improve programmer productivity by using a database that better matches
an application's needs.
 To improve data access performance via some combination of handling larger
data volumes, reducing latency, and improving throughput.
 Since most of the NoSQL databases are open source, testing them is a simple
matter of downloading these products and setting up a test environment.
 Separating parts of applications into services also allows you to introduce
NoSQL into an existing application.
Overview
 NoSQL means that when designing a software solution there are more than
one storage mechanism that could be used based on the needs.
 Due to increasing needs for scalability and performance, alternative systems
have emerged, namely NoSQL technology.
 There are hundreds of readily available NoSQL databases, and each have
different use case scenarios.
 If we categories the NoSQL then we can divide into four main NoSQL
categories
 Document Database
 Key-value Database
 Column Based Database
 Graph Database
Overview
 Before going down the NoSQL path, it's a good recheck whether your existing
DBMS software can be used for the current requirement.
 Using NoSQL databases allows developers to develop without having to
convert in-memory structures to relational structures.
 NoSQL does not have a prescriptive definition but we can make a set of
common observations, such as:
 Not using the relational model
 Running well on clusters
 Mostly open-source
 Built for the 21st century web estates
 Schema-less
Document Database
Document Database
 The document store DBMS stores data at the document level using a markup
language such as JavaScript Object Notation (JSON) or XML.
 The document data model makes it easy for developers to store and combine
data of any structure, without giving up data access and indexing
functionality.
 Database administrators (DBAs) can dynamically modify the schema without
downtime.
 Document databases work well for event logging, online shopping, content
management and in-depth analytical processing.
 The schema flexibility of document databases can also be useful for projects
which required rapid prototyping.
Document Database
 One of the leading NoSQL DBMS’s is MongoDB, an open source document store
DBMS.
 It's designed to make it easy to develop and run modern applications that rely
on structured and unstructured data while delivering scalability and high
availability, and supporting rapidly changing data.
 There are probably more technicians familiar with it than any other NoSQL
DBMS, making it somewhat easier to staff MongoDB projects.
 MongoDB stores data as documents in a binary JSON representation called
Binary JSON (BSON).
 MongoDB is specifically designed for rapidly building applications that scale
globally and are inexpensive to operate.
Document Database
 Another option is Couchbase Server, a JSON-based document store derived
from Couch DB, which is an Apache open source project.
 Couchbase Server delivers eventual consistency for transactions, as opposed
to ACID (atomicity, consistency, isolation, and durability).
 Many NoSQL offerings rely on command line interface (CLI) administration,
but Couchbase Server administration tasks can be performed using the Web,
CLI or RESTful API.
 Another option is MarkLogic Server, it can handle JSON, XML and resource
description framework (RDF) data natively, and offers critical enterprise
features such as ACID transactions, automated failover and security.
Key-Value Database
Key-Value Database
 The key-value approach is somewhat similar to the document approach. Both
offer flexible schemata, but the data in a key-value store isn't structured
using a markup language like JSON.
 Key-value databases excel at session management, serving ad content and
managing user or product profiles. When data is encoded in many different
ways without a rigorous schema, using a key-value database can make sense.
 One of the leading key-value DBMS’s is Redis, an open source, BSD-
licensed, key-value data store.
 Redis is a key-value store, but it also supports different kinds of data
structures. Whereas with traditional key-value stores you associate string keys
to string values, in Redis the value isn't limited to a simple string but can also
hold more complex data structures.
Key-Value Database
 Another NoSQL key-value DBMS option is Riak from Basho Technologies.
 Riak is a fault-tolerant, highly available, scalable, distributed multi-model
DBMS.
 Riak open source is free under the Apache 2 license whereas Riak Enterprise
requires a commercial license agreement, sold by Basho Technologies.
 Riak is more accurately termed a multi-model platform, supporting key-value,
object store and search capabilities all from the same platform.
 Riak is an open source, distributed DBMS that's implemented across multiple
servers, It provides features like any server can respond to read or write
requests. If one server fails, other servers will continue to act upon client
requests.
Column Database
Column Database
 A column store NoSQL DBMS allows you to store data with keys mapped to
values and the values grouped into families that are often accessed together.
 A column database is well-suited for data where writes are uncommon and
applications need to access a few columns of many rows all at once.
 Column stores work well for event logging, content management and
counting/categorizing for analytics.
 Column stores are also useful when you have expiring data because you can
set up a column to automatically expire.
 Apache Cassandra is one of the top NoSQL column family DBMS’s, it's an open
source DBMS, originally developed at Facebook and later released as an open
source project, and is therefore freely available to download and use.
Column Database
 Apache Cassandra is designed to be used by online applications that require
fast performance with no downtime, It was engineered to handle very large
amounts of data spread out across commodity servers to deliver high
availability without a single point of failure.
 DataStax Enterprise, a commercial vendor, has created an enterprise-level
version of Cassandra with support called DataStax Enterprise.
 DataStax Enterprise is free to use in development environments; use in
production requires the purchase of a license (or enrollment in the startup
program).
 DataStax offers subscriptions for both production and non-production
environments that include certified software and support.
Column Database
 Apache HBase is another leading open source NoSQL column store.
 Designed to deliver random, real-time, read/write access to large amounts of
data using commodity hardware, HBase is modeled after Google's Big table
storage system.
 It's built on top of Hadoop and Hadoop Distributed File System (HDFS).
 Although Hadoop and HBase are open source projects there are commercial
providers such as Cloudera, which offers Cloudera Enterprise.
 Apache Hadoop and other open source projects into a single, highly scalable
system for analytical processing. Of course, Cloudera isn't the only
commercial provider; for example, Hortonworks and MapR Technologies are
other leading providers of Hadoop distributions that include HBase.
Graph Database
 The graph database NoSQL category focuses on relationships between values
and stores data using graph structures with nodes, edges and properties.
 In a graph database every element contains a direct pointer to its adjacent
element and no index lookups are necessary.
 It is used in social media (relationship management), search, network and IT
operations, fraud detection, real-time recommendations, digital asset
management and master data management , essentially any application that
benefits from harnessing the power of data relationships using graphs.
 The leading graph database is Neo4j. Neo4j is a native graph
database system, where things are stored as nodes and relationships between
things building the structure of the database.
Graph Database
Graph Database
 Graph databases allow you to store entities and relationships between these
entities. Entities are also known as nodes, which have properties.
 Nodes can have different types of relationships between them, allowing you
to both represent relationships between the domain entities and to have
secondary relationships for things like category, path, time-trees, quad-trees
for spatial indexing, or linked lists for sorted access.
 Since most of the power from the graph databases comes from the
relationships and their properties, a lot of thought and design work is needed
to model the relationships in the domain that we are trying to work with.
 Relationships are first-class citizens in graph databases; most of the value of
graph databases is derived from the relationships.
Graph Database
 There are many graph databases available, such as Neo4J, Infinite
Graph, OrientDB, or FlockDB (which is a special case: a graph database that
only supports single-depth relationships or adjacency lists, where you cannot
traverse more than one level deep for relationships).
 Neo4j offers ACID transactions, high-availability clustering for enterprise
deployments, and comes with a Web-based administration tool.
 Neo4j isn't new technology; the company has been in business for more than a
decade.
 Titan, which is optimized for storing and querying graphs represented over a
cluster of machines.
Graph Database
 Titan has a pluggable storage architecture that allows it to build on proven
database technology such as Apache Cassandra, Apache HBase or Oracle
Berkeley DB.
 Choosing a multi-model approach can make sense for applications needing
several different NoSQL approaches (such as key/value for some data and
graph for others).
 Most NoSQL DBMS offerings are open source and can be licensed for free
under an open source license or via a commercial license from a vendor that
offers support and upgrades.
 The commercial option is recommended for organizations intending to use
NoSQL databases in production applications and systems.
The multi-model DBMS
 Another choice in the NoSQL market is the multi-model DBMS. A growing
number of vendors have delivered DBMS products that support more than one
(or all) of the NoSQL models (some cases, relational, too). Examples of multi-
model NoSQL vendors include DataStrax Enterprises, Foundation DB, Cortex
DB and Orient DB.
 Your existing relational DBMS may also be an option. The relational vendors
are working to expand their DBMS’s to embrace NoSQL, and some have
already started to introduce NoSQL capabilities.
 One example is IBM DB2. The DB2 for Linux, Unix and Windows with a column
store capability, albeit a relational column store and it has the ability to store
RDF graph triples and JSON documents, which may obviate the need for DB2
users to acquire a graph or document database.
Thank You

More Related Content

PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
PDF
Big Data Architecture
PPTX
Building a modern data warehouse
PPTX
Free Training: How to Build a Lakehouse
PDF
Future of Data Engineering
PDF
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
PPTX
Introducing the Snowflake Computing Cloud Data Warehouse
PDF
Introduction to ETL and Data Integration
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Big Data Architecture
Building a modern data warehouse
Free Training: How to Build a Lakehouse
Future of Data Engineering
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Introducing the Snowflake Computing Cloud Data Warehouse
Introduction to ETL and Data Integration

What's hot (20)

PDF
Intro to Delta Lake
PPTX
Snowflake Overview
PPTX
NOSQL vs SQL
PDF
Modernizing to a Cloud Data Architecture
PPTX
Presentation About Big Data (DBMS)
PPTX
Non relational databases-no sql
PDF
SQL vs. NoSQL Databases
PPTX
Relational databases vs Non-relational databases
PPTX
Building the Data Lake with Azure Data Factory and Data Lake Analytics
PPTX
How to Implement Snowflake Security Best Practices with Panther
PPTX
Databricks Fundamentals
PDF
Owning Your Own (Data) Lake House
PDF
Introducing Delta Live Tables: Make Reliable ETL Easy on Delta Lake
PPS
PDF
Big Data Ecosystem
PDF
Ebook - The Guide to Master Data Management
PPTX
Snowflake Architecture.pptx
PDF
Introducing Databricks Delta
PPTX
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
PDF
Building Lakehouses on Delta Lake with SQL Analytics Primer
Intro to Delta Lake
Snowflake Overview
NOSQL vs SQL
Modernizing to a Cloud Data Architecture
Presentation About Big Data (DBMS)
Non relational databases-no sql
SQL vs. NoSQL Databases
Relational databases vs Non-relational databases
Building the Data Lake with Azure Data Factory and Data Lake Analytics
How to Implement Snowflake Security Best Practices with Panther
Databricks Fundamentals
Owning Your Own (Data) Lake House
Introducing Delta Live Tables: Make Reliable ETL Easy on Delta Lake
Big Data Ecosystem
Ebook - The Guide to Master Data Management
Snowflake Architecture.pptx
Introducing Databricks Delta
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
Building Lakehouses on Delta Lake with SQL Analytics Primer
Ad

Viewers also liked (20)

PDF
NoSQL databases
PPTX
Introduction to NoSQL Databases
PDF
NoSQL Now! NoSQL Architecture Patterns
ZIP
NoSQL databases
PPTX
Security Design Concepts
PPTX
Types of databases
PDF
Sharding
PDF
CAP Theory
PPTX
introduction to NoSQL
PPTX
NoSQL Databases for Implementing Data Services – Should I Care?
PDF
Temporal Databases: Queries
PDF
Big Data and select suitable tools
PPTX
PPTX
Database Programming Techniques
PPT
Dbms sixth chapter_part-1_2011
PPT
Mobile Database
PPTX
Mobile database security threats
PPT
PL/SQL
PPTX
Jena Programming
NoSQL databases
Introduction to NoSQL Databases
NoSQL Now! NoSQL Architecture Patterns
NoSQL databases
Security Design Concepts
Types of databases
Sharding
CAP Theory
introduction to NoSQL
NoSQL Databases for Implementing Data Services – Should I Care?
Temporal Databases: Queries
Big Data and select suitable tools
Database Programming Techniques
Dbms sixth chapter_part-1_2011
Mobile Database
Mobile database security threats
PL/SQL
Jena Programming
Ad

Similar to Selecting best NoSQL (20)

PPTX
No SQL- The Future Of Data Storage
PPTX
NOSQL Databases types and Uses
PPTX
unit2-ppt1.pptx
PDF
Nosql data models
PDF
Relational vs. Non-Relational
PDF
Overview of no sql
PPTX
NoSql - mayank singh
PPTX
No sql databases
PDF
Vskills Apache Cassandra sample material
PPTX
noSQL choices
PDF
The NoSQL Ecosystem
PDF
HPTS 2011: The NoSQL Ecosystem
DOCX
Report 2.0.docx
PDF
NOsql Presentation.pdf
PPT
NoSql Databases
PPTX
NoSQL.pptx
PDF
PPTX
2.Introduction to NOSQL (Core concepts).pptx
PDF
All About Database v1.1
PPTX
NoSQL.pptx
No SQL- The Future Of Data Storage
NOSQL Databases types and Uses
unit2-ppt1.pptx
Nosql data models
Relational vs. Non-Relational
Overview of no sql
NoSql - mayank singh
No sql databases
Vskills Apache Cassandra sample material
noSQL choices
The NoSQL Ecosystem
HPTS 2011: The NoSQL Ecosystem
Report 2.0.docx
NOsql Presentation.pdf
NoSql Databases
NoSQL.pptx
2.Introduction to NOSQL (Core concepts).pptx
All About Database v1.1
NoSQL.pptx

More from Mohammed Fazuluddin (20)

PDF
Cloud Providers and Their Key Features Explained
PDF
Database Performance Handling : A comprehensive guide
PDF
Design patterns Q&A | Important question and answers
PDF
Software-Requirements-to-System-Design Basics
PDF
MEAN-vs-MERN-A-Developers-Guide and Explanation
PDF
Cloud AI Deployment Design Patterns - Learn the Basic Deployment Patterns
PDF
Auto-scaling-real-time-software-applications-and-best-practices.pdf
PDF
Java Version(v5 -v23) Features with sample code snippet
PDF
Cloud Architecture Framework Pillar’s.pdf
PDF
Implementing Generative AI and Machine Learning on GCP: Architectures, Use Ca...
PDF
LEVERAGING AWS GENERATIVE AI: ARCHITECTURAL INSIGHTS AND REAL-WORLD IMPLEMENT...
PDF
Basics of GraphQL : Unlocking the Power of GraphQL
PPTX
SQL Injection Introduction and Prevention
PPTX
DOMAIN DRIVER DESIGN
PPTX
New Relic Basics
PPTX
Terraform Basics
PPTX
Rest API Security - A quick understanding of Rest API Security
PPTX
Software architectural patterns - A Quick Understanding Guide
PPTX
Mule ESB - An Enterprise Service Bus
PPTX
Docker - A Quick Introduction Guide
Cloud Providers and Their Key Features Explained
Database Performance Handling : A comprehensive guide
Design patterns Q&A | Important question and answers
Software-Requirements-to-System-Design Basics
MEAN-vs-MERN-A-Developers-Guide and Explanation
Cloud AI Deployment Design Patterns - Learn the Basic Deployment Patterns
Auto-scaling-real-time-software-applications-and-best-practices.pdf
Java Version(v5 -v23) Features with sample code snippet
Cloud Architecture Framework Pillar’s.pdf
Implementing Generative AI and Machine Learning on GCP: Architectures, Use Ca...
LEVERAGING AWS GENERATIVE AI: ARCHITECTURAL INSIGHTS AND REAL-WORLD IMPLEMENT...
Basics of GraphQL : Unlocking the Power of GraphQL
SQL Injection Introduction and Prevention
DOMAIN DRIVER DESIGN
New Relic Basics
Terraform Basics
Rest API Security - A quick understanding of Rest API Security
Software architectural patterns - A Quick Understanding Guide
Mule ESB - An Enterprise Service Bus
Docker - A Quick Introduction Guide

Recently uploaded (20)

PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
KodekX | Application Modernization Development
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
cuic standard and advanced reporting.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
A Presentation on Artificial Intelligence
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Encapsulation theory and applications.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
KodekX | Application Modernization Development
Dropbox Q2 2025 Financial Results & Investor Presentation
Understanding_Digital_Forensics_Presentation.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Network Security Unit 5.pdf for BCA BBA.
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Electronic commerce courselecture one. Pdf
Modernizing your data center with Dell and AMD
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
CIFDAQ's Market Insight: SEC Turns Pro Crypto
cuic standard and advanced reporting.pdf
MYSQL Presentation for SQL database connectivity
Spectral efficient network and resource selection model in 5G networks
A Presentation on Artificial Intelligence
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
20250228 LYD VKU AI Blended-Learning.pptx
Unlocking AI with Model Context Protocol (MCP)
Chapter 3 Spatial Domain Image Processing.pdf
Encapsulation theory and applications.pdf

Selecting best NoSQL

  • 1. Selecting No-SQL DBMS Finding the best NoSQL DBMS @Mohammed Fazuluddin
  • 2. Topics  Why choose NoSQL database  Overview  Brief on different type of NoSQL’s
  • 3. Why choose NoSQL database  To improve programmer productivity by using a database that better matches an application's needs.  To improve data access performance via some combination of handling larger data volumes, reducing latency, and improving throughput.  Since most of the NoSQL databases are open source, testing them is a simple matter of downloading these products and setting up a test environment.  Separating parts of applications into services also allows you to introduce NoSQL into an existing application.
  • 4. Overview  NoSQL means that when designing a software solution there are more than one storage mechanism that could be used based on the needs.  Due to increasing needs for scalability and performance, alternative systems have emerged, namely NoSQL technology.  There are hundreds of readily available NoSQL databases, and each have different use case scenarios.  If we categories the NoSQL then we can divide into four main NoSQL categories  Document Database  Key-value Database  Column Based Database  Graph Database
  • 5. Overview  Before going down the NoSQL path, it's a good recheck whether your existing DBMS software can be used for the current requirement.  Using NoSQL databases allows developers to develop without having to convert in-memory structures to relational structures.  NoSQL does not have a prescriptive definition but we can make a set of common observations, such as:  Not using the relational model  Running well on clusters  Mostly open-source  Built for the 21st century web estates  Schema-less
  • 7. Document Database  The document store DBMS stores data at the document level using a markup language such as JavaScript Object Notation (JSON) or XML.  The document data model makes it easy for developers to store and combine data of any structure, without giving up data access and indexing functionality.  Database administrators (DBAs) can dynamically modify the schema without downtime.  Document databases work well for event logging, online shopping, content management and in-depth analytical processing.  The schema flexibility of document databases can also be useful for projects which required rapid prototyping.
  • 8. Document Database  One of the leading NoSQL DBMS’s is MongoDB, an open source document store DBMS.  It's designed to make it easy to develop and run modern applications that rely on structured and unstructured data while delivering scalability and high availability, and supporting rapidly changing data.  There are probably more technicians familiar with it than any other NoSQL DBMS, making it somewhat easier to staff MongoDB projects.  MongoDB stores data as documents in a binary JSON representation called Binary JSON (BSON).  MongoDB is specifically designed for rapidly building applications that scale globally and are inexpensive to operate.
  • 9. Document Database  Another option is Couchbase Server, a JSON-based document store derived from Couch DB, which is an Apache open source project.  Couchbase Server delivers eventual consistency for transactions, as opposed to ACID (atomicity, consistency, isolation, and durability).  Many NoSQL offerings rely on command line interface (CLI) administration, but Couchbase Server administration tasks can be performed using the Web, CLI or RESTful API.  Another option is MarkLogic Server, it can handle JSON, XML and resource description framework (RDF) data natively, and offers critical enterprise features such as ACID transactions, automated failover and security.
  • 11. Key-Value Database  The key-value approach is somewhat similar to the document approach. Both offer flexible schemata, but the data in a key-value store isn't structured using a markup language like JSON.  Key-value databases excel at session management, serving ad content and managing user or product profiles. When data is encoded in many different ways without a rigorous schema, using a key-value database can make sense.  One of the leading key-value DBMS’s is Redis, an open source, BSD- licensed, key-value data store.  Redis is a key-value store, but it also supports different kinds of data structures. Whereas with traditional key-value stores you associate string keys to string values, in Redis the value isn't limited to a simple string but can also hold more complex data structures.
  • 12. Key-Value Database  Another NoSQL key-value DBMS option is Riak from Basho Technologies.  Riak is a fault-tolerant, highly available, scalable, distributed multi-model DBMS.  Riak open source is free under the Apache 2 license whereas Riak Enterprise requires a commercial license agreement, sold by Basho Technologies.  Riak is more accurately termed a multi-model platform, supporting key-value, object store and search capabilities all from the same platform.  Riak is an open source, distributed DBMS that's implemented across multiple servers, It provides features like any server can respond to read or write requests. If one server fails, other servers will continue to act upon client requests.
  • 14. Column Database  A column store NoSQL DBMS allows you to store data with keys mapped to values and the values grouped into families that are often accessed together.  A column database is well-suited for data where writes are uncommon and applications need to access a few columns of many rows all at once.  Column stores work well for event logging, content management and counting/categorizing for analytics.  Column stores are also useful when you have expiring data because you can set up a column to automatically expire.  Apache Cassandra is one of the top NoSQL column family DBMS’s, it's an open source DBMS, originally developed at Facebook and later released as an open source project, and is therefore freely available to download and use.
  • 15. Column Database  Apache Cassandra is designed to be used by online applications that require fast performance with no downtime, It was engineered to handle very large amounts of data spread out across commodity servers to deliver high availability without a single point of failure.  DataStax Enterprise, a commercial vendor, has created an enterprise-level version of Cassandra with support called DataStax Enterprise.  DataStax Enterprise is free to use in development environments; use in production requires the purchase of a license (or enrollment in the startup program).  DataStax offers subscriptions for both production and non-production environments that include certified software and support.
  • 16. Column Database  Apache HBase is another leading open source NoSQL column store.  Designed to deliver random, real-time, read/write access to large amounts of data using commodity hardware, HBase is modeled after Google's Big table storage system.  It's built on top of Hadoop and Hadoop Distributed File System (HDFS).  Although Hadoop and HBase are open source projects there are commercial providers such as Cloudera, which offers Cloudera Enterprise.  Apache Hadoop and other open source projects into a single, highly scalable system for analytical processing. Of course, Cloudera isn't the only commercial provider; for example, Hortonworks and MapR Technologies are other leading providers of Hadoop distributions that include HBase.
  • 17. Graph Database  The graph database NoSQL category focuses on relationships between values and stores data using graph structures with nodes, edges and properties.  In a graph database every element contains a direct pointer to its adjacent element and no index lookups are necessary.  It is used in social media (relationship management), search, network and IT operations, fraud detection, real-time recommendations, digital asset management and master data management , essentially any application that benefits from harnessing the power of data relationships using graphs.  The leading graph database is Neo4j. Neo4j is a native graph database system, where things are stored as nodes and relationships between things building the structure of the database.
  • 19. Graph Database  Graph databases allow you to store entities and relationships between these entities. Entities are also known as nodes, which have properties.  Nodes can have different types of relationships between them, allowing you to both represent relationships between the domain entities and to have secondary relationships for things like category, path, time-trees, quad-trees for spatial indexing, or linked lists for sorted access.  Since most of the power from the graph databases comes from the relationships and their properties, a lot of thought and design work is needed to model the relationships in the domain that we are trying to work with.  Relationships are first-class citizens in graph databases; most of the value of graph databases is derived from the relationships.
  • 20. Graph Database  There are many graph databases available, such as Neo4J, Infinite Graph, OrientDB, or FlockDB (which is a special case: a graph database that only supports single-depth relationships or adjacency lists, where you cannot traverse more than one level deep for relationships).  Neo4j offers ACID transactions, high-availability clustering for enterprise deployments, and comes with a Web-based administration tool.  Neo4j isn't new technology; the company has been in business for more than a decade.  Titan, which is optimized for storing and querying graphs represented over a cluster of machines.
  • 21. Graph Database  Titan has a pluggable storage architecture that allows it to build on proven database technology such as Apache Cassandra, Apache HBase or Oracle Berkeley DB.  Choosing a multi-model approach can make sense for applications needing several different NoSQL approaches (such as key/value for some data and graph for others).  Most NoSQL DBMS offerings are open source and can be licensed for free under an open source license or via a commercial license from a vendor that offers support and upgrades.  The commercial option is recommended for organizations intending to use NoSQL databases in production applications and systems.
  • 22. The multi-model DBMS  Another choice in the NoSQL market is the multi-model DBMS. A growing number of vendors have delivered DBMS products that support more than one (or all) of the NoSQL models (some cases, relational, too). Examples of multi- model NoSQL vendors include DataStrax Enterprises, Foundation DB, Cortex DB and Orient DB.  Your existing relational DBMS may also be an option. The relational vendors are working to expand their DBMS’s to embrace NoSQL, and some have already started to introduce NoSQL capabilities.  One example is IBM DB2. The DB2 for Linux, Unix and Windows with a column store capability, albeit a relational column store and it has the ability to store RDF graph triples and JSON documents, which may obviate the need for DB2 users to acquire a graph or document database.