2024-03-24 08:06:44 +00:00
---
title: Data Processing with GraalJS
sidebar_label: Java + GraalJS
pagination_prev: demos/bigdata/index
pagination_next: solutions/input
---
import current from '/version.js';
2024-03-26 07:33:37 +00:00
import Tabs from '@theme/Tabs';
import TabItem from '@theme/TabItem';
2024-03-24 08:06:44 +00:00
import CodeBlock from '@theme/CodeBlock';
[GraalJS ](https://www.graalvm.org/latest/reference-manual/js/ ) is a JS engine
for Java. The project offers a JSR-223 compliant OpenJDK-compatible build.
[SheetJS ](https://sheetjs.com ) is a JavaScript library for reading and writing
data from spreadsheets.
The ["Complete Example" ](#complete-example ) section includes a complete Java
command-line tool for reading data from spreadsheets and printing CSV rows.
:::info pass
The Java packages used in this demo are covered under open source licenses. The
Universal Permissive License covers most of the packages, while `icu4j` uses
a different, yet still permissive, license.
:::
## Integration Details
:::info pass
When this demo was last tested, raw byte arrays could not be passed to GraalJS.
**This is a limitation of the default GraalJS behavior.**
Instead, this demo uses Nashorn Compatibility Mode[^1] through a runtime flag:
```js
java -Dpolyglot.js.nashorn-compat=true ...
```
:::
The [Nashorn demo ](/docs/demos/engines/nashorn ) code and explanation applies to
JSR-223 compatible `ScriptEngine` implementations, including GraalJS.
## Complete Example
:::note Tested Deployments
This demo was tested in the following deployments:
| OpenJDK | GraalJS | Date |
|:--------|:--------|:-----------|
2024-12-18 05:47:18 +00:00
| 23.0.1 | 24.1.1 | 2024-12-17 |
| 22.0.2 | 24.1.1 | 2024-12-17 |
| 21.0.5 | 24.1.1 | 2024-12-17 |
| 20.0.2 | 24.1.1 | 2024-12-17 |
| 19.0.2 | 24.1.1 | 2024-12-17 |
| 18.0.2 | 24.1.1 | 2024-12-17 |
| 17.0.13 | 24.1.1 | 2024-12-17 |
2024-03-24 08:06:44 +00:00
:::
### Compilation
0) Download GraalJS and its dependencies:
```bash
2024-12-18 05:47:18 +00:00
curl -LO "https://repo1.maven.org/maven2/org/graalvm/js/js-scriptengine/24.1.1/js-scriptengine-24.1.1.jar"
curl -LO "https://repo1.maven.org/maven2/org/graalvm/js/js-language/24.1.1/js-language-24.1.1.jar"
curl -LO "https://repo1.maven.org/maven2/org/graalvm/polyglot/polyglot/24.1.1/polyglot-24.1.1.jar"
curl -LO "https://repo1.maven.org/maven2/org/graalvm/sdk/collections/24.1.1/collections-24.1.1.jar"
curl -LO "https://repo1.maven.org/maven2/org/graalvm/truffle/truffle-api/24.1.1/truffle-api-24.1.1.jar"
curl -LO "https://repo1.maven.org/maven2/org/graalvm/sdk/nativeimage/24.1.1/nativeimage-24.1.1.jar"
curl -LO "https://repo1.maven.org/maven2/org/graalvm/shadowed/icu4j/24.1.1/icu4j-24.1.1.jar"
curl -LO "https://repo1.maven.org/maven2/org/graalvm/regex/regex/24.1.1/regex-24.1.1.jar"
2024-03-24 08:06:44 +00:00
```
1) Download the SheetJS Standalone script, shim script and test file. Move all
three files to the project directory:
< ul >
< li > < a href = {`https://cdn.sheetjs.com/xlsx-${current}/package/dist/xlsx.full.min.js`} > xlsx.full.min.js< / a > < / li >
< li > < a href = {`https://cdn.sheetjs.com/xlsx-${current}/package/dist/shim.min.js`} > shim.min.js< / a > < / li >
2024-04-26 04:16:13 +00:00
< li > < a href = "https://docs.sheetjs.com/pres.xlsx" > pres.xlsx< / a > < / li >
2024-03-24 08:06:44 +00:00
< / ul >
< CodeBlock language = "bash" > {`\
curl -LO https://cdn.sheetjs.com/xlsx-${current}/package/dist/xlsx.full.min.js
curl -LO https://cdn.sheetjs.com/xlsx-${current}/package/dist/shim.min.js
2024-04-26 04:16:13 +00:00
curl -LO https://docs.sheetjs.com/pres.xlsx`}
2024-03-24 08:06:44 +00:00
< / CodeBlock >
2) Download [`SheetJSNashorn.java` ](pathname:///nashorn/SheetJSNashorn.java ):
```bash
curl -LO https://docs.sheetjs.com/nashorn/SheetJSNashorn.java
```
3) Build the sample class:
```bash
javac SheetJSNashorn.java
```
This program tries to parse the file specified by the first argument and prints
CSV rows from the first worksheet.
### Standalone Test
4) Run the command directly:
2024-03-26 07:33:37 +00:00
< Tabs groupId = "os" >
< TabItem value = "unix" label = "Linux/MacOS" >
```bash
2024-12-18 05:47:18 +00:00
java -cp ".:js-scriptengine-24.1.1.jar:js-language-24.1.1.jar:polyglot-24.1.1.jar:collections-24.1.1.jar:truffle-api-24.1.1.jar:nativeimage-24.1.1.jar:icu4j-24.1.1.jar:regex-24.1.1.jar" -Dpolyglot.js.nashorn-compat=true SheetJSNashorn pres.xlsx
2024-03-26 07:33:37 +00:00
```
< / TabItem >
< TabItem value = "win" label = "Windows" >
2024-03-24 08:06:44 +00:00
```bash
2024-12-18 05:47:18 +00:00
java -cp ".;js-scriptengine-24.1.1.jar;js-language-24.1.1.jar;polyglot-24.1.1.jar;collections-24.1.1.jar;truffle-api-24.1.1.jar;nativeimage-24.1.1.jar;icu4j-24.1.1.jar;regex-24.1.1.jar" -D"polyglot.js.nashorn-compat=true" SheetJSNashorn pres.xlsx
2024-03-24 08:06:44 +00:00
```
2024-03-26 07:33:37 +00:00
< / TabItem >
< / Tabs >
2024-03-24 08:06:44 +00:00
If successful, CSV rows from the first worksheet will be displayed.
### Java Archive Test
5) Assemble a Java Archive:
```bash
jar -cf SheetJSNashorn.jar SheetJSNashorn.class xlsx.full.min.js shim.min.js
```
6) Create new directory and copy the archives and test file:
```bash
mkdir -p sheethorn
cp *.jar pres.xlsx sheethorn
cd sheethorn
```
7) Run the program using the Java Archive:
2024-03-26 07:33:37 +00:00
< Tabs groupId = "os" >
< TabItem value = "unix" label = "Linux/MacOS" >
```bash
2024-12-18 05:47:18 +00:00
java -cp ".:js-scriptengine-24.1.1.jar:js-language-24.1.1.jar:polyglot-24.1.1.jar:collections-24.1.1.jar:truffle-api-24.1.1.jar:nativeimage-24.1.1.jar:icu4j-24.1.1.jar:regex-24.1.1.jar:SheetJSNashorn.jar" -Dpolyglot.js.nashorn-compat=true SheetJSNashorn pres.xlsx
2024-03-26 07:33:37 +00:00
```
< / TabItem >
< TabItem value = "win" label = "Windows" >
2024-03-24 08:06:44 +00:00
```bash
2024-12-18 05:47:18 +00:00
java -cp ".;js-scriptengine-24.1.1.jar;js-language-24.1.1.jar;polyglot-24.1.1.jar;collections-24.1.1.jar;truffle-api-24.1.1.jar;nativeimage-24.1.1.jar;icu4j-24.1.1.jar;regex-24.1.1.jar;SheetJSNashorn.jar" -D"polyglot.js.nashorn-compat=true" SheetJSNashorn pres.xlsx
2024-03-24 08:06:44 +00:00
```
2024-03-26 07:33:37 +00:00
< / TabItem >
< / Tabs >
2024-03-24 08:06:44 +00:00
This should print the same CSV rows from Step 4.
[^1]: See ["Nashorn Compatibility Mode" ](https://www.graalvm.org/latest/reference-manual/js/NashornMigrationGuide/#nashorn-compatibility-mode ) in the GraalJS documentation.