Skip to content

A Python utility for importing CSV files into IBM Netezza database using named pipes and automatic data type detection.

License

Notifications You must be signed in to change notification settings

KrzysztofDusko/NetezzaPythonImport

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Netezza CSV Import Tool

A Python utility for importing CSV files into IBM Netezza database using named pipes and automatic data type detection.

Features

  • Automatic CSV delimiter detection (supports |, ;, \t, ,)
  • Smart data type inference for columns
  • Handles common date/time formats
  • Supports large files through streaming
  • Progress monitoring during import
  • UTF-8 encoding support

Supported Data Types

  • BIGINT: For integer values
  • NUMERIC: For decimal numbers
  • DATE: For dates in YYYY-MM-DD format
  • DATETIME: For timestamps
  • NVARCHAR: For text fields

Usage

python main.py filename [-l LOG_DIR] [-d DRIVER]

Arguments:
  filename              Path to CSV file to import
  -l, --log_dir        Log directory path (default: C:\log)
  -d, --driver         Driver to use (default: dotnet)

Example

python main.py data.csv -l C:\netezza\logs -d dotnet

The tool will:

  1. Analyze the CSV file to determine column types
  2. Generate CREATE TABLE SQL statement
  3. Create a named pipe
  4. Stream data through the pipe to Netezza

Requirements

  • Python 3.7+
  • pywin32 package
  • Windows OS (due to named pipes implementation)
  • IBM Netezza client tools

About

A Python utility for importing CSV files into IBM Netezza database using named pipes and automatic data type detection.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages