Getting Started with Amazon Neptune
Overview
Graph databases, such as Amazon Neptune Database and Amazon Neptune Analytics, are purpose built to store and navigate relationships. They have advantages over relational databases for use cases such as social networking, recommendation engines, and fraud detection where you need to create complex relationships between data and quickly query these relationships. Amazon Neptune uses graph structures such as nodes (data entities), edges (relationships), and properties to represent and store data. The relationships are stored as first-order citizens of the data model. This allows data in nodes to be directly linked, dramatically improving the performance of queries that navigate relationships in the data.
Getting started with Amazon Neptune Database
If you already have your data in a graph model, it’s easy to get started with Amazon Neptune Database. You can load data in CSV or RDF formats and begin writing graph queries with Apache TinkerPop Gremlin, SPARQL, or openCypher. You can use the getting started documentation or view the AWS Online Tech Talk through the following links. We've also consolidated best practices for Neptune Database as well.
Getting started with Amazon Neptune Analytics
You can get started with Neptune Analytics in a few steps by creating a graph using the AWS Management Console or the CDK, SDK, or CLI. AWS CloudFormation support coming soon. You can load a graph into Neptune Analytics from data in an Amazon S3 bucket or from a Neptune database. You can send requests using the openCypher query language to a graph in Neptune Analytics directly from your graph applications. You can also connect to the graph in Neptune Analytics from a Jupyter notebook to run queries and graph algorithms. Results of analytic queries can be written back into the Neptune Analytics graph to serve incoming queries or stored within S3 for further processing. Neptune Analytics supports integration with the open-source LangChain library to work with existing applications powered by large language models.
Getting started with Amazon Neptune ML
- Setting up the test environment
- Launching the node classification notebook sample
- Loading the sample data into the cluster
- Exporting the graph
- Performing ML training
- Running Gremlin queries with Neptune ML
Getting started with graph visualization
If you are familiar with graph query languages or running graph workloads in a notebook environment, you can start with Neptune notebooks. Neptune provides Jupyter and JupyterLab notebooks in the open-source Neptune graph notebook project on GitHub and in the Neptune workbench. These notebooks offer sample application tutorials and code snippets in an interactive coding environment where you can learn about graph technology and Neptune.
Neptune notebooks can both visualize query results and provide an IDE-like interface for application development and testing, or you can use Neptune notebooks with other Neptune features such as Neptune Streams and Neptune ML. Additionally, each Neptune notebook hosts a Graph Explorer endpoint. You can find a link to open Graph Explorer on each notebook instance in the Amazon Neptune console.