GreenplumPython
Contents:
Requirements
Requirements on Server for Advanced Features
Tutorials
Comparison with SQL
Comparison with pandas
Predicting the age of abalone with Linear Regression
Generating, Indexing and Searching Embeddings (Experimental)
Installing Python Packages on Server without Internet (Experimental)
Module References
GreenplumPython
»
Tutorials
Edit on GitHub
Tutorials
Comparison with SQL
Prerequisites
Getting Access to Database
Accessing a DataFrame in the Database
Basic Data Manipulation
Joining Two DataFrames
Creating and Calling Functions
Data Grouping
Comparison with pandas
Data Structure
Data Selection
Data Transformation
Predicting the age of abalone with Linear Regression
Background
Problem
Fetching data from the Internet
Train-Test Set Split
Import preparation
Data Exploration
Learning to Make Predictions
Generating, Indexing and Searching Embeddings (Experimental)
Installing the Package
Preparing Data
Generating and Indexing Embeddings
Generating Embeddings without Indexing
Semantic Search by Embeddings
Cleaning All at Once
Installing Python Packages on Server without Internet (Experimental)
(Optional) Prerequisite: Sharing Python Environments in a Cluster with NFS
Example: A UDF requiring a Third-Party Package
Installing Python Packages