Skip to content

Commit d054cca

Browse files
authored
Merge pull request #9254 from PsiACE/weekly-72
docs(weekly): add this week in databend 72
2 parents 6f5006e + 54d6041 commit d054cca

File tree

1 file changed

+181
-0
lines changed

1 file changed

+181
-0
lines changed
Lines changed: 181 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,181 @@
1+
---
2+
title: 'This Week in Databend #72'
3+
date: 2022-12-14
4+
slug: 2022-12-14-databend-weekly
5+
tags: [databend, weekly]
6+
authors:
7+
- name: PsiACE
8+
url: https://github.com/psiace
9+
image_url: https://github.com/psiace.png
10+
---
11+
12+
[Databend](https://github.com/datafuselabs/databend) is a powerful cloud data warehouse. Built for elasticity and efficiency. Free and open. Also available in the cloud: <https://app.databend.com> .
13+
14+
## What's New
15+
16+
Check out what we've done this week to make Databend even better for you.
17+
18+
### Features & Improvements :sparkles:
19+
20+
**Multiple Catalogs**
21+
22+
- extends show databases SQL ([#9152](https://github.com/datafuselabs/databend/pull/9152))
23+
24+
**Stage**
25+
26+
- support select from URI ([#9247](https://github.com/datafuselabs/databend/pull/9247))
27+
28+
**Streaming Load**
29+
30+
- support `file_format` syntax in streaming load insert sql ([#9063](https://github.com/datafuselabs/databend/pull/9063))
31+
32+
**Planner**
33+
34+
- push down `limit` to `union` ([#9210](https://github.com/datafuselabs/databend/pull/9210))
35+
36+
**Query**
37+
38+
- use `analyze table` instead of `optimize table statistic` ([#9143](https://github.com/datafuselabs/databend/pull/9143))
39+
- fast parse insert values ([#9214](https://github.com/datafuselabs/databend/pull/9214))
40+
41+
**Storage**
42+
43+
- use distinct count calculated by the xor hash function ([#9159](https://github.com/datafuselabs/databend/pull/9159))
44+
- `read_parquet` read meta before read data ([#9154](https://github.com/datafuselabs/databend/pull/9154))
45+
- push down filter to parquet reader ([#9199](https://github.com/datafuselabs/databend/pull/9199))
46+
- prune row groups before reading ([#9228](https://github.com/datafuselabs/databend/pull/9228))
47+
48+
**Open Sharing**
49+
50+
- add prototype open sharing and add sharing stateful tests ([#9177](https://github.com/datafuselabs/databend/pull/9177))
51+
52+
### Code Refactoring :tada:
53+
54+
**\***
55+
56+
- simplify the global data registry logic ([#9187](https://github.com/datafuselabs/databend/pull/9187))
57+
58+
**Storage**
59+
60+
- refactor deletion ([#8824](https://github.com/datafuselabs/databend/pull/8824))
61+
62+
### Build/Testing/CI Infra Changes :electric_plug:
63+
64+
- release databend deb package and databend with hive ([#9138](https://github.com/datafuselabs/databend/pull/9138), [#9241](https://github.com/datafuselabs/databend/pull/9241), etc.)
65+
66+
### Bug Fixes :wrench:
67+
68+
**Format**
69+
70+
- support ASCII control code hex as format field delimiter ([#9160](https://github.com/datafuselabs/databend/pull/9160))
71+
72+
**Planner**
73+
74+
- prewhere_column empty and predicate is not const will return empty ([#9116](https://github.com/datafuselabs/databend/pull/9116))
75+
- don't push down topk to Merge when it's child is Aggregate ([#9183](https://github.com/datafuselabs/databend/pull/9183))
76+
- fix nullable column validity not equal ([#9220](https://github.com/datafuselabs/databend/pull/9220))
77+
78+
**Query**
79+
80+
- address unit test hang on test_insert ([#9242](https://github.com/datafuselabs/databend/pull/9242))
81+
82+
**Storage**
83+
84+
- too many io requests for read blocks during compact ([#9128](https://github.com/datafuselabs/databend/pull/9128))
85+
- collect orphan snapshots ([#9108](https://github.com/datafuselabs/databend/pull/9108))
86+
87+
## What's On In Databend
88+
89+
Stay connected with the latest news about Databend.
90+
91+
#### Breaking Change: Unified File Format Options
92+
93+
To simplify, we're rolling out a set of unified file format options as follows for the COPY INTO command, the Streaming Load API, and all the other cases where users need to describe their file formats:
94+
95+
```sql
96+
[ FILE_FORMAT = ( TYPE = { CSV | TSV | NDJSON | PARQUET | XML} [ formatTypeOptions ] ) ]
97+
```
98+
99+
- Please note that the current format options starting with `format_*` will be deprecated.
100+
- `... FORMAT CSV ...` will still be accepted by the ClickHouse handler.
101+
- Support for customized formats created by `CREATE FILE FORMAT ...` will be added in a future release: `... FILE_FORMAT = (format_name = 'MyCustomCSV') ....` .
102+
103+
**Learn More**
104+
105+
- [Issue | unify format options and remove format_ settings](https://github.com/datafuselabs/databend/issues/8995)
106+
- [PR | support file_format syntax in streaming load insert sql](https://github.com/datafuselabs/databend/pull/9063)
107+
108+
#### Open Sharing
109+
110+
Open Sharing is a simple and secure data-sharing protocol designed for databend-query nodes running in a multi-cloud environment.
111+
112+
- **Simple & Free**: Open Sharing is open-source and basically a RESTful API implementation.
113+
- **Secure**: Open Sharing verifies incoming requesters' identities and access permissions, and provides an audit log.
114+
- **Multi-Cloud**: Open Sharing supports a variety of public cloud platforms, including AWS, Azure, GCP, etc.
115+
116+
**Learn More**
117+
118+
- [Docs | Open Sharing](https://github.com/datafuselabs/databend/blob/main/src/query/sharing-endpoint/README.md)
119+
- [PR | add prototype open sharing and add sharing stateful tests](https://github.com/datafuselabs/databend/pull/9177)
120+
121+
## What's Up Next
122+
123+
We're always open to cutting-edge technologies and innovative ideas. You're more than welcome to join the community and bring them to Databend.
124+
125+
#### Refactoring Stage-Related Tests
126+
127+
We're about to run stage-related tests again using the Streaming Load API to move files to a stage instead of an AWS command like this:
128+
129+
```bash
130+
aws --endpoint-url ${STORAGE_S3_ENDPOINT_URL} s3 cp s3://testbucket/admin/data/ontime_200.csv s3://testbucket/admin/stage/internal/s1/ontime_200.csv >/dev/null 2>&1
131+
```
132+
133+
This is because Databend users do not need to take care of, or do not even know the stage paths that the AWS command requires.
134+
135+
[Issue 8528: refactor stage related tests](https://github.com/datafuselabs/databend/issues/8528)
136+
137+
Please let us know if you're interested in contributing to this issue, or pick up a good first issue at <https://link.databend.rs/i-m-feeling-lucky> to get started.
138+
139+
## Changelog
140+
141+
You can check the changelog of Databend Nightly for details about our latest developments.
142+
143+
- [v0.8.152-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.152-nightly)
144+
- [v0.8.151-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.151-nightly)
145+
- [v0.8.150-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.150-nightly)
146+
- [v0.8.149-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.149-nightly)
147+
- [v0.8.148-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.148-nightly)
148+
- [v0.8.147-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.147-nightly)
149+
- [v0.8.146-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.146-nightly)
150+
- [v0.8.145-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.145-nightly)
151+
152+
## Contributors
153+
154+
Thanks a lot to the contributors for their excellent work this week.
155+
156+
[<img alt="ariesdevil" src="https://avatars.githubusercontent.com/u/7812909?v=4&s=117" width="117" />](https://github.com/ariesdevil) |[<img alt="b41sh" src="https://avatars.githubusercontent.com/u/1070352?v=4&s=117" width="117" />](https://github.com/b41sh) |[<img alt="BohuTANG" src="https://avatars.githubusercontent.com/u/172204?v=4&s=117" width="117" />](https://github.com/BohuTANG) |[<img alt="Chasen-Zhang" src="https://avatars.githubusercontent.com/u/15354455?v=4&s=117" width="117" />](https://github.com/Chasen-Zhang) |[<img alt="ClSlaid" src="https://avatars.githubusercontent.com/u/44747719?v=4&s=117" width="117" />](https://github.com/ClSlaid) |[<img alt="dantengsky" src="https://avatars.githubusercontent.com/u/22081156?v=4&s=117" width="117" />](https://github.com/dantengsky) |
157+
:---: |:---: |:---: |:---: |:---: |:---: |
158+
[ariesdevil](https://github.com/ariesdevil) |[b41sh](https://github.com/b41sh) |[BohuTANG](https://github.com/BohuTANG) |[Chasen-Zhang](https://github.com/Chasen-Zhang) |[ClSlaid](https://github.com/ClSlaid) |[dantengsky](https://github.com/dantengsky) |
159+
160+
[<img alt="drmingdrmer" src="https://avatars.githubusercontent.com/u/44069?v=4&s=117" width="117" />](https://github.com/drmingdrmer) |[<img alt="hantmac" src="https://avatars.githubusercontent.com/u/7600925?v=4&s=117" width="117" />](https://github.com/hantmac) |[<img alt="lichuang" src="https://avatars.githubusercontent.com/u/1998569?v=4&s=117" width="117" />](https://github.com/lichuang) |[<img alt="mergify[bot]" src="https://avatars.githubusercontent.com/in/10562?v=4&s=117" width="117" />](https://github.com/apps/mergify) |[<img alt="PsiACE" src="https://avatars.githubusercontent.com/u/36896360?v=4&s=117" width="117" />](https://github.com/PsiACE) |[<img alt="RinChanNOWWW" src="https://avatars.githubusercontent.com/u/33975039?v=4&s=117" width="117" />](https://github.com/RinChanNOWWW) |
161+
:---: |:---: |:---: |:---: |:---: |:---: |
162+
[drmingdrmer](https://github.com/drmingdrmer) |[hantmac](https://github.com/hantmac) |[lichuang](https://github.com/lichuang) |[mergify[bot]](https://github.com/apps/mergify) |[PsiACE](https://github.com/PsiACE) |[RinChanNOWWW](https://github.com/RinChanNOWWW) |
163+
164+
[<img alt="soyeric128" src="https://avatars.githubusercontent.com/u/106025534?v=4&s=117" width="117" />](https://github.com/soyeric128) |[<img alt="sundy-li" src="https://avatars.githubusercontent.com/u/3325189?v=4&s=117" width="117" />](https://github.com/sundy-li) |[<img alt="wubx" src="https://avatars.githubusercontent.com/u/320680?v=4&s=117" width="117" />](https://github.com/wubx) |[<img alt="Xuanwo" src="https://avatars.githubusercontent.com/u/5351546?v=4&s=117" width="117" />](https://github.com/Xuanwo) |[<img alt="xudong963" src="https://avatars.githubusercontent.com/u/41979257?v=4&s=117" width="117" />](https://github.com/xudong963) |[<img alt="youngsofun" src="https://avatars.githubusercontent.com/u/5782159?v=4&s=117" width="117" />](https://github.com/youngsofun) |
165+
:---: |:---: |:---: |:---: |:---: |:---: |
166+
[soyeric128](https://github.com/soyeric128) |[sundy-li](https://github.com/sundy-li) |[wubx](https://github.com/wubx) |[Xuanwo](https://github.com/Xuanwo) |[xudong963](https://github.com/xudong963) |[youngsofun](https://github.com/youngsofun) |
167+
168+
[<img alt="ZhiHanZ" src="https://avatars.githubusercontent.com/u/25170437?v=4&s=117" width="117" />](https://github.com/ZhiHanZ) |[<img alt="zhyass" src="https://avatars.githubusercontent.com/u/34016424?v=4&s=117" width="117" />](https://github.com/zhyass) |[<img alt="zzzdong" src="https://avatars.githubusercontent.com/u/5125482?v=4&s=117" width="117" />](https://github.com/zzzdong) |
169+
:---: |:---: |:---: |
170+
[ZhiHanZ](https://github.com/ZhiHanZ) |[zhyass](https://github.com/zhyass) |[zzzdong](https://github.com/zzzdong) |
171+
172+
## Connect With Us
173+
174+
We'd love to hear from you. Feel free to run the code and see if Databend works for you. Submit an issue with your problem if you need help.
175+
176+
[DatafuseLabs Community](https://github.com/datafuselabs/) is open to everyone who loves data warehouses. Please join the community and share your thoughts.
177+
178+
- [Databend Official Website](https://databend.rs)
179+
- [GitHub Discussions](https://github.com/datafuselabs/databend/discussions) (Feature requests, bug reports, and contributions)
180+
- [Twitter](https://twitter.com/Datafuse_Labs) (Stay in the know)
181+
- [Slack Channel](https://link.databend.rs/join-slack) (Chat with the community)

0 commit comments

Comments
 (0)