Skip to content

Commit ea6cb4c

Browse files
committed
docs(weekly): add this week in databend 72
1 parent 00f9f74 commit ea6cb4c

File tree

1 file changed

+187
-0
lines changed

1 file changed

+187
-0
lines changed
Lines changed: 187 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,187 @@
1+
---
2+
title: 'This Week in Databend #72'
3+
date: 2022-12-14
4+
slug: 2022-12-14-databend-weekly
5+
tags: [databend, weekly]
6+
authors:
7+
- name: PsiACE
8+
url: https://github.com/psiace
9+
image_url: https://github.com/psiace.png
10+
---
11+
12+
[Databend](https://github.com/datafuselabs/databend) is a powerful cloud data warehouse. Built for elasticity and efficiency. Free and open. Also available in the cloud: <https://app.databend.com> .
13+
14+
## What's New
15+
16+
Check out what we've done this week to make Databend even better for you.
17+
18+
### Features & Improvements :sparkles:
19+
20+
**Multiple Catalogs**
21+
22+
- extends show databases SQL ([#9152](https://github.com/datafuselabs/databend/pull/9152))
23+
24+
**Stage**
25+
26+
- support select from URI ([#9247](https://github.com/datafuselabs/databend/pull/9247))
27+
28+
**Streaming Load**
29+
30+
- support `file_format` syntax in streaming load insert sql ([#9063](https://github.com/datafuselabs/databend/pull/9063))
31+
32+
**Planner**
33+
34+
- push down `limit` to `union` ([#9210](https://github.com/datafuselabs/databend/pull/9210))
35+
36+
**Query**
37+
38+
- use `analyze table` instead of `optimize table statistic` ([#9143](https://github.com/datafuselabs/databend/pull/9143))
39+
- fast parse insert values ([#9214](https://github.com/datafuselabs/databend/pull/9214))
40+
41+
**Storage**
42+
43+
- use distinct count calculated by the xor hash function ([#9159](https://github.com/datafuselabs/databend/pull/9159))
44+
- `read_parquet` read meta before read data ([#9154](https://github.com/datafuselabs/databend/pull/9154))
45+
- push down filter to parquet reader ([#9199](https://github.com/datafuselabs/databend/pull/9199))
46+
- prune row groups before reading ([#9228](https://github.com/datafuselabs/databend/pull/9228))
47+
48+
**Open Sharing**
49+
50+
- add prototype open sharing and add sharing stateful tests ([#9177](https://github.com/datafuselabs/databend/pull/9177))
51+
52+
### Code Refactoring :tada:
53+
54+
**\***
55+
56+
- simplify the global data registry logic ([#9187](https://github.com/datafuselabs/databend/pull/9187))
57+
58+
**Storage**
59+
60+
- refactor deletion ([#8824](https://github.com/datafuselabs/databend/pull/8824))
61+
62+
### Build/Testing/CI Infra Changes :electric_plug:
63+
64+
- release databend deb package and databend with hive ([#9138](https://github.com/datafuselabs/databend/pull/9138), [#9241](https://github.com/datafuselabs/databend/pull/9241), etc.)
65+
66+
### Bug Fixes :wrench:
67+
68+
**Format**
69+
70+
- support ASCII control code hex as format field delimiter ([#9160](https://github.com/datafuselabs/databend/pull/9160))
71+
72+
**Planner**
73+
74+
- prewhere_column empty and predicate is not const will return empty ([#9116](https://github.com/datafuselabs/databend/pull/9116))
75+
- don't push down topk to Merge when it's child is Aggregate ([#9183](https://github.com/datafuselabs/databend/pull/9183))
76+
- fix nullable column validity not equal ([#9220](https://github.com/datafuselabs/databend/pull/9220))
77+
78+
**Query**
79+
80+
- address unit test hang on test_insert ([#9242](https://github.com/datafuselabs/databend/pull/9242))
81+
82+
**Storage**
83+
84+
- too many io requests for read blocks during compact ([#9128](https://github.com/datafuselabs/databend/pull/9128))
85+
- collect orphan snapshots ([#9108](https://github.com/datafuselabs/databend/pull/9108))
86+
87+
## What's On In Databend
88+
89+
Stay connected with the latest news about Databend.
90+
91+
#### Unifying Format Options and Removing `format_*` Settings
92+
93+
> ***Breaking Change*** We plan to unify the formatting options and remove `format_*` settings in the future.
94+
> This means that we will **disable** the old format.
95+
> Work is in progress and support for `FILE_FORMAT` has now been completed in streming load.
96+
97+
##### Propose
98+
99+
In the new implementation, **only** use the snowflake style in any place (insert copy and "select from stage" later):
100+
101+
- `FILE_FORMAT = (type = 'CSV' some_option = '|')`
102+
- `FILE_FORMAT = (format_name = 'MyCustomCSV')`
103+
104+
`FORMAT CSV` is retained for the ClickHouse handler.
105+
106+
MyCustomCSV is created using SQL like `CREATE FILE FORMAT`
107+
108+
**Learn More**
109+
110+
- [Issue | unify format options and remove format_ settings](https://github.com/datafuselabs/databend/issues/8995)
111+
- [PR | support file_format syntax in streaming load insert sql](https://github.com/datafuselabs/databend/pull/9063)
112+
113+
#### Open Sharing
114+
115+
Open Sharing is a cheap and secure data sharing protocol for databend query on multi-cloud environments.
116+
117+
* **Cheap**: Open Sharing allow data sharing via simple RESTful API sharing protocol, which is cheap and easy to understand.
118+
* **Secure**: Open Sharing protocol would verify allow incoming requesters identity and access permission and provide audit log.
119+
* **Multi-cloud**: Open Sharing is designed to work with different cloud platforms, including AWS, Azure, GCP, etc.
120+
* **Open source**: Open Sharing is an open source project
121+
122+
**Learn More**
123+
124+
- [Docs | Open Sharing](https://github.com/datafuselabs/databend/blob/main/src/query/sharing-endpoint/README.md)
125+
- [PR | add prototype open sharing and add sharing stateful tests](https://github.com/datafuselabs/databend/pull/9177)
126+
127+
## What's Up Next
128+
129+
We're always open to cutting-edge technologies and innovative ideas. You're more than welcome to join the community and bring them to Databend.
130+
131+
#### Refactoring Stage-Related Tests
132+
133+
We should use streaming upload to put files directly in the stage instead of in s3.
134+
135+
```bash
136+
aws --endpoint-url ${STORAGE_S3_ENDPOINT_URL} s3 cp s3://testbucket/admin/data/ontime_200.csv s3://testbucket/admin/stage/internal/s1/ontime_200.csv >/dev/null 2>&1
137+
```
138+
139+
This is wrong, and stage's path rule should not be leak.
140+
141+
[Issue 8528: refactor stage related tests](https://github.com/datafuselabs/databend/issues/8528)
142+
143+
Please let us know if you're interested in contributing to this issue, or pick up a good first issue at <https://link.databend.rs/i-m-feeling-lucky> to get started.
144+
145+
## Changelog
146+
147+
You can check the changelog of Databend Nightly for details about our latest developments.
148+
149+
- [v0.8.152-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.152-nightly)
150+
- [v0.8.151-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.151-nightly)
151+
- [v0.8.150-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.150-nightly)
152+
- [v0.8.149-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.149-nightly)
153+
- [v0.8.148-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.148-nightly)
154+
- [v0.8.147-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.147-nightly)
155+
- [v0.8.146-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.146-nightly)
156+
- [v0.8.145-nightly](https://github.com/datafuselabs/databend/releases/tag/v0.8.145-nightly)
157+
158+
## Contributors
159+
160+
Thanks a lot to the contributors for their excellent work this week.
161+
162+
[<img alt="ariesdevil" src="https://avatars.githubusercontent.com/u/7812909?v=4&s=117" width="117" />](https://github.com/ariesdevil) |[<img alt="b41sh" src="https://avatars.githubusercontent.com/u/1070352?v=4&s=117" width="117" />](https://github.com/b41sh) |[<img alt="BohuTANG" src="https://avatars.githubusercontent.com/u/172204?v=4&s=117" width="117" />](https://github.com/BohuTANG) |[<img alt="Chasen-Zhang" src="https://avatars.githubusercontent.com/u/15354455?v=4&s=117" width="117" />](https://github.com/Chasen-Zhang) |[<img alt="ClSlaid" src="https://avatars.githubusercontent.com/u/44747719?v=4&s=117" width="117" />](https://github.com/ClSlaid) |[<img alt="dantengsky" src="https://avatars.githubusercontent.com/u/22081156?v=4&s=117" width="117" />](https://github.com/dantengsky) |
163+
:---: |:---: |:---: |:---: |:---: |:---: |
164+
[ariesdevil](https://github.com/ariesdevil) |[b41sh](https://github.com/b41sh) |[BohuTANG](https://github.com/BohuTANG) |[Chasen-Zhang](https://github.com/Chasen-Zhang) |[ClSlaid](https://github.com/ClSlaid) |[dantengsky](https://github.com/dantengsky) |
165+
166+
[<img alt="drmingdrmer" src="https://avatars.githubusercontent.com/u/44069?v=4&s=117" width="117" />](https://github.com/drmingdrmer) |[<img alt="hantmac" src="https://avatars.githubusercontent.com/u/7600925?v=4&s=117" width="117" />](https://github.com/hantmac) |[<img alt="lichuang" src="https://avatars.githubusercontent.com/u/1998569?v=4&s=117" width="117" />](https://github.com/lichuang) |[<img alt="mergify[bot]" src="https://avatars.githubusercontent.com/in/10562?v=4&s=117" width="117" />](https://github.com/apps/mergify) |[<img alt="PsiACE" src="https://avatars.githubusercontent.com/u/36896360?v=4&s=117" width="117" />](https://github.com/PsiACE) |[<img alt="RinChanNOWWW" src="https://avatars.githubusercontent.com/u/33975039?v=4&s=117" width="117" />](https://github.com/RinChanNOWWW) |
167+
:---: |:---: |:---: |:---: |:---: |:---: |
168+
[drmingdrmer](https://github.com/drmingdrmer) |[hantmac](https://github.com/hantmac) |[lichuang](https://github.com/lichuang) |[mergify[bot]](https://github.com/apps/mergify) |[PsiACE](https://github.com/PsiACE) |[RinChanNOWWW](https://github.com/RinChanNOWWW) |
169+
170+
[<img alt="soyeric128" src="https://avatars.githubusercontent.com/u/106025534?v=4&s=117" width="117" />](https://github.com/soyeric128) |[<img alt="sundy-li" src="https://avatars.githubusercontent.com/u/3325189?v=4&s=117" width="117" />](https://github.com/sundy-li) |[<img alt="wubx" src="https://avatars.githubusercontent.com/u/320680?v=4&s=117" width="117" />](https://github.com/wubx) |[<img alt="Xuanwo" src="https://avatars.githubusercontent.com/u/5351546?v=4&s=117" width="117" />](https://github.com/Xuanwo) |[<img alt="xudong963" src="https://avatars.githubusercontent.com/u/41979257?v=4&s=117" width="117" />](https://github.com/xudong963) |[<img alt="youngsofun" src="https://avatars.githubusercontent.com/u/5782159?v=4&s=117" width="117" />](https://github.com/youngsofun) |
171+
:---: |:---: |:---: |:---: |:---: |:---: |
172+
[soyeric128](https://github.com/soyeric128) |[sundy-li](https://github.com/sundy-li) |[wubx](https://github.com/wubx) |[Xuanwo](https://github.com/Xuanwo) |[xudong963](https://github.com/xudong963) |[youngsofun](https://github.com/youngsofun) |
173+
174+
[<img alt="ZhiHanZ" src="https://avatars.githubusercontent.com/u/25170437?v=4&s=117" width="117" />](https://github.com/ZhiHanZ) |[<img alt="zhyass" src="https://avatars.githubusercontent.com/u/34016424?v=4&s=117" width="117" />](https://github.com/zhyass) |[<img alt="zzzdong" src="https://avatars.githubusercontent.com/u/5125482?v=4&s=117" width="117" />](https://github.com/zzzdong) | | | |
175+
:---: |:---: |:---: |:---: |:---: |:---: |
176+
[ZhiHanZ](https://github.com/ZhiHanZ) |[zhyass](https://github.com/zhyass) |[zzzdong](https://github.com/zzzdong) | | | |
177+
178+
## Connect With Us
179+
180+
We'd love to hear from you. Feel free to run the code and see if Databend works for you. Submit an issue with your problem if you need help.
181+
182+
[DatafuseLabs Community](https://github.com/datafuselabs/) is open to everyone who loves data warehouses. Please join the community and share your thoughts.
183+
184+
- [Databend Official Website](https://databend.rs)
185+
- [GitHub Discussions](https://github.com/datafuselabs/databend/discussions) (Feature requests, bug reports, and contributions)
186+
- [Twitter](https://twitter.com/Datafuse_Labs) (Stay in the know)
187+
- [Slack Channel](https://link.databend.rs/join-slack) (Chat with the community)

0 commit comments

Comments
 (0)