You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: web/pandas/community/ecosystem.md
+6-157Lines changed: 6 additions & 157 deletions
Original file line number
Diff line number
Diff line change
@@ -151,20 +151,6 @@ or MATLAB, modified in a GUI, or embedded in apps and dashboards. Plotly
151
151
is free for unlimited sharing, and has cloud, offline, or on-premise
152
152
accounts for private use.
153
153
154
-
### [Lux](https://github.com/lux-org/lux)
155
-
156
-
Lux is a Python library that facilitates fast and easy experimentation with data by automating the visual data exploration process. To use Lux, simply add an extra import alongside pandas:
157
-
158
-
```python
159
-
import lux
160
-
import pandas as pd
161
-
162
-
df = pd.read_csv("data.csv")
163
-
df # discover interesting insights!
164
-
```
165
-
166
-
By printing out a dataframe, Lux automatically [recommends a set of visualizations](https://raw.githubusercontent.com/lux-org/lux-resources/master/readme_img/demohighlight.gif) that highlights interesting trends and patterns in the dataframe. Users can leverage any existing pandas commands without modifying their code, while being able to visualize their pandas data structures (e.g., DataFrame, Series, Index) at the same time. Lux also offers a [powerful, intuitive language](https://lux-api.readthedocs.io/en/latest/source/guide/vis.html) that allow users to create Altair, matplotlib, or Vega-Lite visualizations without having to think at the level of code.
167
-
168
154
### [D-Tale](https://github.com/man-group/dtale)
169
155
170
156
D-Tale is a lightweight web client for visualizing pandas data structures. It
@@ -386,92 +372,14 @@ Use `pandas_gbq.read_gbq` and `pandas_gbq.to_gbq`, instead.
ArcticDB is a serverless DataFrame database engine designed for the Python Data Science ecosystem. ArcticDB enables you to store, retrieve, and process pandas DataFrames at scale. It is a storage engine designed for object storage and also supports local-disk storage using LMDB. ArcticDB requires zero additional infrastructure beyond a running Python environment and access to object storage and can be installed in seconds. Please find full documentation [here](https://docs.arcticdb.io/latest/).
390
-
391
-
#### ArcticDB Terminology
392
-
393
-
ArcticDB is structured to provide a scalable and efficient way to manage and retrieve DataFrames, organized into several key components:
394
-
395
-
-`Object Store` Collections of libraries. Used to separate logical environments from each other. Analogous to a database server.
396
-
-`Library` Contains multiple symbols which are grouped in a certain way (different users, markets, etc). Analogous to a database.
397
-
-`Symbol` Atomic unit of data storage. Identified by a string name. Data stored under a symbol strongly resembles a pandas DataFrame. Analogous to tables.
398
-
-`Version` Every modifying action (write, append, update) performed on a symbol creates a new version of that object.
399
-
400
-
#### Installation
401
-
402
-
To install, simply run:
403
-
404
-
```console
405
-
pip install arcticdb
406
-
```
407
-
408
-
To get started, we can import ArcticDB and instantiate it:
409
-
410
-
```python
411
-
import arcticdb as adb
412
-
import numpy as np
413
-
import pandas as pd
414
-
# this will set up the storage using the local file system
415
-
arctic = adb.Arctic("lmdb://arcticdb_test")
416
-
```
417
-
418
-
> **Note:** ArcticDB supports any S3 API compatible storage, including AWS. ArcticDB also supports Azure Blob storage.
419
-
> ArcticDB also supports LMDB for local/file based storage - to use LMDB, pass an LMDB path as the URI: `adb.Arctic('lmdb://path/to/desired/database')`.
420
-
421
-
#### Library Setup
422
-
423
-
ArcticDB is geared towards storing many (potentially millions) of tables. Individual tables (DataFrames) are called symbols and are stored in collections called libraries. A single library can store many symbols. Libraries must first be initialized prior to use:
Now we have a library set up, we can get to reading and writing data. ArcticDB has a set of simple functions for DataFrame storage. Let's write a DataFrame to storage.
432
-
433
-
```python
434
-
df = pd.DataFrame(
435
-
{
436
-
"a": list("abc"),
437
-
"b": list(range(1, 4)),
438
-
"c": np.arange(3, 6).astype("u1"),
439
-
"d": np.arange(4.0, 7.0, dtype="float64"),
440
-
"e": [True, False, True],
441
-
"f": pd.date_range("20130101", periods=3)
442
-
}
443
-
)
444
-
445
-
df
446
-
df.dtypes
447
-
```
448
-
449
-
Write to ArcticDB.
450
-
451
-
```python
452
-
write_record = lib.write("test", df)
453
-
```
454
-
455
-
> **Note:** When writing pandas DataFrames, ArcticDB supports the following index types:
> The "row" concept in `head`/`tail` refers to the row number ('iloc'), not the value in the `pandas.Index` ('loc').
375
+
ArcticDB is a serverless DataFrame database engine designed for the Python Data Science ecosystem.
376
+
ArcticDB enables you to store, retrieve, and process pandas DataFrames at scale.
377
+
It is a storage engine designed for object storage and also supports local-disk storage using LMDB.
378
+
ArcticDB requires zero additional infrastructure beyond a running Python environment and access
379
+
to object storage and can be installed in seconds.
463
380
464
-
#### Reading Data from ArcticDB
381
+
Please find full documentation [here](https://docs.arcticdb.io/latest/).
465
382
466
-
Read the data back from storage:
467
-
468
-
```python
469
-
read_record = lib.read("test")
470
-
read_record.data
471
-
df.dtypes
472
-
```
473
-
474
-
ArcticDB also supports appending, updating, and querying data from storage to a pandas DataFrame. Please find more information [here](https://docs.arcticdb.io/latest/api/processing/#arcticdb.QueryBuilder).
0 commit comments