Create a PostgreSQL tables from Excel files with automatic data type detection based on Sheetjs cell format
Go to file
2024-11-20 18:35:46 -05:00
test_files refactor(excel): enhance date detection and number parsing 2024-11-20 17:45:26 -05:00
.gitignore new-script: added SheetJSPG.js demo for doc 2024-11-20 18:33:20 -05:00
gen_test_files.py initial 2024-11-20 14:41:09 -05:00
package-lock.json refactor(excel): enhance date detection and number parsing 2024-11-20 17:45:26 -05:00
package.json initial 2024-11-20 14:41:09 -05:00
README.md initial 2024-11-20 14:41:09 -05:00
requirements.txt initial 2024-11-20 14:41:09 -05:00
SheetJSPG.js new-script: added SheetJSPG.js demo for doc 2024-11-20 18:33:20 -05:00
sql-types.js new-script: added SheetJSPG.js demo for doc 2024-11-20 18:33:20 -05:00
test.js new-script: added SheetJSPG.js demo for doc 2024-11-20 18:33:20 -05:00

Sheetjs to PostgreSQL Creating a Table Demo

A Node.js utility that intelligently converts Sheetjs worksheet to PostgreSQL tables while preserving appropriate data types.

This demo project serves as a refernce implementation for SheetJS + PostgreSQL integration. For more details, vist the SheetJS Documentation.

Features

  • Automatic data type detection from Excel columns
  • Support various data formats:
    • Numbers (integer and floating-point)
    • Dates
    • Booleans
    • Text
  • Handles special number formats (scientific notations, high precision)
  • Clean column names for PostgreSQL compatibility

Prerequisites

  • Node.js
  • PostgreSQL (16)
  • Python 3.x

Installation

  1. Install Python dependencies:
pip install -r requirements.txt
  1. Install Node.js dependencies:
npm install i

Setup

  1. Generate test_files:
python3 gen_test_files.py
  1. Configure PostgreSQL connection in test.js
const client = new Client({
    host: 'localhost',
    database: 'SheetJSPG',
    user: 'postgres',
    password: '7509'
});

Run

node test.js

Test Files

The test suite includes various Excel files testing different data scenarios:

  • number_formats.xlsx: Various numeric formats
  • date_formats.xlsx: Date handling
  • special_values.xlsx: Edge cases
  • precision.xlsx: High-precision numbers
  • string_formats.xlsx: Text handling
  • boolean_formats.xlsx: Boolean values

Type Mapping

  • Excel dates → PostgreSQL date
  • Booleans → PostgreSQL boolean
  • High-precision numbers → PostgreSQL numeric
  • Standard numbers → PostgreSQL double precision
  • Text/other → PostgreSQL text