docs.sheetjs.com/docz/docs/03-demos/32-extensions/11-matlab.md

316 lines
9.0 KiB
Markdown
Raw Normal View History

2023-09-14 08:19:13 +00:00
---
title: Modern Spreadsheets in MATLAB
sidebar_label: MATLAB
description: Build complex data pipelines in MATLAB M-Files. Seamlessly create MATLAB tables with SheetJS. Leverage the MATLAB toolbox ecosystem to analyze data from Excel workbooks.
pagination_prev: demos/cloud/index
pagination_next: demos/bigdata/index
2024-05-04 16:15:00 +00:00
sidebar_custom_props:
summary: Convert between MATLAB tables and spreadsheets
2023-09-14 08:19:13 +00:00
---
import current from '/version.js';
2024-12-22 04:47:57 +00:00
import Tabs from '@theme/Tabs';
import TabItem from '@theme/TabItem';
2023-09-14 08:19:13 +00:00
import CodeBlock from '@theme/CodeBlock';
[MATLAB](https://www.mathworks.com/products/matlab.html) is a numeric computing
platform. It has a native `table` type with limited support for spreadsheets.
[SheetJS](https://sheetjs.com) is a JavaScript library for reading and writing
data from spreadsheets.
This demo uses SheetJS to pull data from a spreadsheet for further analysis
within MATLAB. We'll explore how to run an external tool to convert complex
spreadsheets into simple XLSX files for MATLAB.
2023-11-28 07:05:59 +00:00
:::note Tested Deployments
2023-09-14 08:19:13 +00:00
2024-06-09 21:48:23 +00:00
This demo was tested by SheetJS users in the following deployments:
| Architecture | Version | Date |
|:-------------|:--------|:-----------|
| `darwin-x64` | R2024a | 2024-06-09 |
2024-12-22 04:47:57 +00:00
| `win11-x64` | R2024b | 2024-12-21 |
2023-09-14 08:19:13 +00:00
:::
:::info pass
MATLAB has limited support for processing spreadsheets through `readtable`[^1]
and `writetable`[^2]. At the time of writing, it lacked support for XLSB,
NUMBERS, and other common spreadsheet formats.
SheetJS libraries help fill the gap by normalizing spreadsheets to a form that
MATLAB can understand.
:::
## Integration Details
:::note pass
MATLAB does not currently provide a way to parse a CSV string or a character
array representing file data. `readtable`, `writetable`, `csvread`, and
`csvwrite` work with the file system directly. `strread` and `textscan` are
designed specifically for reading numbers.
:::
The current recommendation involves a dedicated command-line tool that leverages
SheetJS libraries to to perform spreadsheet processing.
The [SheetJS NodeJS module](/docs/getting-started/installation/nodejs) can be
loaded in NodeJS scripts and bundled in standalone command-line tools.
### Command-Line Tools
2024-03-18 08:24:41 +00:00
The ["Command-Line Tools" demo](/docs/demos/cli) creates `xlsx-cli`, a
2023-09-14 08:19:13 +00:00
command-line tool that reads a spreadsheet file and generates output. The
examples in the "NodeJS" section are able to generate XLSX spreadsheets using
the `--xlsx` command line flag:
```bash
$ xlsx-cli --xlsx ./pres.numbers ## generates pres.numbers.xlsx
```
:::note pass
The command-line tool supports a number of formats including XLSB (`--xlsb`).
:::
The tools pair the SheetJS `readFile`[^3] and `writeFile`[^4] methods to read
data from arbitrary spreadsheet files and convert to XLSX:
```js
const XLSX = require("xlsx"); // load the SheetJS library
const wb = XLSX.readFile("input.xlsb"); // read input.xlsb
XLSX.writeFile(wb, "output.xlsx"); // export to output.xlsx
```
### MATLAB commands
The MATLAB `system` command[^5] can run command-line tools in M-files. For
example, if the `xlsx-cli` tool is placed in the workspace folder and the
test file `pres.numbers` is in the Downloads folder, the following command
generates the XLSX file `pres.numbers.xlsx` :
```matlab
% generate ~/Downloads/pres.numbers.xlsx from ~/Downloads/pres.numbers
system("./xlsx-cli --xlsx ~/Downloads/pres.numbers");
```
:::note pass
In an interactive session, the exclamation point operator `!`[^6] can be used:
```matlab
% generate ~/Downloads/pres.numbers.xlsx from ~/Downloads/pres.numbers
!./xlsx-cli --xlsx ~/Downloads/pres.numbers
```
:::
### Reading Files
Starting from an arbitrary spreadsheet, `xlsx-cli` can generate a XLSX workbook.
Once the workbook is written, the XLSX file can be parsed with `readtable`:
```matlab
% `filename` points to the file to be parsed
filename = "~/Downloads/pres.numbers";
% generate filename+".xlsx"
system("./xlsx-cli --xlsx " + filename)
% read using `readtable`
tbl = readtable(filename + ".xlsx");
```
The following diagram depicts the workbook waltz:
```mermaid
flowchart LR
subgraph MATLAB `system` invocation
file[(workbook\nunknown type)]
xlsx(XLSX\nNormalized Data)
end
data[(table)]
file --> |`xlsx-cli`\nSheetJS| xlsx
xlsx --> |`readtable`\nMATLAB| data
```
2024-12-22 04:47:57 +00:00
### Writing Files
2023-09-14 08:19:13 +00:00
Starting from an MATLAB table, `writetable` can generate a XLSX workbook. Once
the workbook is written, `xlsx-cli` can translate to NUMBERS or other formats:
```matlab
% tbl is the table
tbl = table({"Sheet";"JS"}, [72;62], 'VariableNames', ["Name", "Index"])
% `filename` points to the file to be written
filename = "~/Downloads/sorted.xlsx";
% write using `writetable`
writetable(tbl, filename);
% generate filename+".xlsb"
system("./xlsx-cli --xlsb " + filename);
```
The following diagram depicts the workbook waltz:
```mermaid
flowchart LR
subgraph MATLAB `system` invocation
file[(XLSB\nworkbook)]
xlsx(XLSX\nNormalized Data)
end
data[(table)]
data --> |`writetable`\nMATLAB| xlsx
xlsx --> |`xlsx-cli`\nSheetJS| file
```
## Complete Demo
2024-04-26 04:16:13 +00:00
This demo processes [`pres.numbers`](https://docs.sheetjs.com/pres.numbers).
2023-09-14 08:19:13 +00:00
There are 3 parts to the demo:
A) "Import": SheetJS tooling will read the test file and generate a clean XLSX
file. MATLAB will read the file using `readtable`.
B) "Process": Using `sortrows`, MATLAB will reverse the table order.
C) "Export": The modified table will be exported to XLSX using `writetable`.
SheetJS tooling will convert the file to XLSB.
```mermaid
flowchart LR
ifile[(NUMBERS)]
ixlsx(XLSX)
ofile[(XLSB)]
oxlsx(XLSX)
data[(table)]
ifile --> |`xlsx-cli`\nSheetJS| ixlsx
ixlsx --> |`readtable`\nMATLAB| data
data -.-> |Data Processing| data
data --> |`writetable`\nMATLAB| oxlsx
oxlsx --> |`xlsx-cli`\nSheetJS| ofile
```
2024-12-22 04:47:57 +00:00
0) Launch MATLAB and run the following command to print the workspace folder:
```matlab
pwd
```
This folder is typically `MATLAB` within the `Documents` folder for the account.
2024-12-22 04:47:57 +00:00
1) Open a new macOS Terminal or Windows PowerShell window.
2) Navigate to the workspace folder displayed in Step 0.
<Tabs groupId="os">
<TabItem value="win" label="Windows">
In Windows, the folder is typically `C:\Users\username\Documents\MATLAB`. Since
PowerShell sessions start from the user folder, the command is:
```bash
cd Documents\MATLAB
```
</TabItem>
<TabItem value="mac" label="macOS">
In macOS, the folder is typically `~/Documents/MATLAB` so the command is:
```bash
cd ~/Documents/MATLAB
```
</TabItem>
</Tabs>
1) Create the standalone `xlsx-cli` binary[^7]. The following commands should be
run in the macOS Terminal or Windows PowerShell:
2023-09-14 08:19:13 +00:00
<CodeBlock language="bash">{`\
npm i --save https://cdn.sheetjs.com/xlsx-${current}/xlsx-${current}.tgz exit-on-epipe commander@2
curl -LO https://docs.sheetjs.com/cli/xlsx-cli.js
npx nexe -t 14.15.3 xlsx-cli.js`}
</CodeBlock>
2024-12-22 04:47:57 +00:00
2) Download https://docs.sheetjs.com/pres.numbers to the workspace folder:
2023-09-14 08:19:13 +00:00
```bash
2024-12-22 04:47:57 +00:00
curl -LO https://docs.sheetjs.com/pres.numbers
2023-09-14 08:19:13 +00:00
```
2024-12-22 04:47:57 +00:00
4) Save the following to `SheetJSMATLAB.m` in the workspace folder:
2023-09-14 08:19:13 +00:00
2024-12-22 04:47:57 +00:00
<Tabs groupId="os">
<TabItem value="win" label="Windows">
```matlab title="SheetJSMATLAB.m"
% Import data from NUMBERS file
system(".\xlsx-cli.exe --xlsx pres.numbers");
tbl = readtable("pres.numbers.xlsx");
% Process data (reverse sort)
sorted = sortrows(tbl,"Index", "descend");
% Export data to XLSB workbook
writetable(sorted,"sorted.xlsx");
system(".\xlsx-cli.exe --xlsb sorted.xlsx");
2023-09-14 08:19:13 +00:00
```
2024-12-22 04:47:57 +00:00
</TabItem>
<TabItem value="mac" label="macOS">
2023-09-14 08:19:13 +00:00
```matlab title="SheetJSMATLAB.m"
% Import data from NUMBERS file
2024-12-22 04:47:57 +00:00
system("./xlsx-cli --xlsx pres.numbers");
tbl = readtable("pres.numbers.xlsx");
2023-09-14 08:19:13 +00:00
% Process data (reverse sort)
sorted = sortrows(tbl,"Index", "descend");
% Export data to XLSB workbook
2024-12-22 04:47:57 +00:00
writetable(sorted,"sorted.xlsx");
system("./xlsx-cli --xlsb sorted.xlsx");
2023-09-14 08:19:13 +00:00
```
2024-12-22 04:47:57 +00:00
</TabItem>
</Tabs>
2023-09-14 08:19:13 +00:00
5) In a MATLAB desktop session, run the `SheetJSMATLAB` command:
```matlab
>> SheetJSMATLAB
```
2024-12-22 04:47:57 +00:00
It will create the file `sorted.xlsx.xlsb` in the MATLAB workspace folder. Open
the file and confirm that the table is sorted by Index in descending order:
2023-09-14 08:19:13 +00:00
```
Name Index
Joseph Biden 46
Donald Trump 45
Barack Obama 44
GeorgeW Bush 43
Bill Clinton 42
```
:::tip pass
If the `matlab` command is available on the system `PATH`, the "headless"
version of the command is:
```bash
matlab -batch SheetJSMATLAB
```
:::
[^1]: See [`readtable`](https://www.mathworks.com/help/matlab/ref/readtable.html) in the MATLAB documentation.
[^2]: See [`writetable`](https://www.mathworks.com/help/matlab/ref/writetable.html) in the MATLAB documentation.
[^3]: See [`readFile` in "Reading Files"](/docs/api/parse-options)
[^4]: See [`writeFile` in "Writing Files"](/docs/api/write-options)
[^5]: See [`system`](https://www.mathworks.com/help/matlab/ref/system.html) in the MATLAB documentation.
[^6]: See ["MATLAB Operators and Special Characters](https://www.mathworks.com/help/matlab/matlab_prog/matlab-operators-and-special-characters.html) in the MATLAB documentation.
2024-03-18 08:24:41 +00:00
[^7]: See ["Command-line Tools"](/docs/demos/cli) for more details.