Skip to content

Commit 737539e

Browse files
committed
WIP
1 parent 244cdb0 commit 737539e

File tree

1 file changed

+37
-1
lines changed

1 file changed

+37
-1
lines changed

_posts/2024-05-28-pyos-newsletter-june-2024.md

Lines changed: 37 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,12 +22,19 @@ We feel so incredibly fortunate to have connected with so many of you at PyCon U
2222
</a>
2323
</figure>
2424

25-
During PyCon US we had over 20 people contribute to pyOpenSci at our sprint, where contributors found bugs in our [beginner-friendly Python Packaging tutorials](https://www.pyopensci.org/python-package-guide/), fixed continuing integration issues, helped with our peer review process infrastructure, and even improved our packaging guide! Keep an eye out for more in an upcoming blog post from our Executive Director and Founder, Leah Wasser!
25+
### pyOpenSci's PyCon US Open Space
26+
This year, pyOpenSci's first Open Space brought together 17 attendees from various organizations, including Anaconda, NVIDIA, CPython, and more. The discussions focused on interoperability, scientific data workflows, and Python packaging for science, emphasizing the need for consistent tools and data formats to reduce cognitive load for scientists. A key topic was the gap between exploratory scripts and fully packaged code, highlighting the need for "in between" options for sharing code and outputs. These insights will guide pyOpenSci's efforts, with more discussions planned for the Fall Festival, scheduled for September 28-29, 2024.
27+
28+
### pyOpenSci's PyCon US Sprint
29+
This year we had a tremendous turnout of over 20 people from several countries for our 1-day pyConUS sprint. At last count this resulted in about [30 issues and pull requests](https://github.com/orgs/pyOpenSci/projects/12/views/1) where volunteers who were there to support scientists, helped us fix issues on our website, in our tutorials and our [packaging](https://www.pyopensci.org/python-package-guide/) and [peer review](https://www.pyopensci.org/software-peer-review/index.html) guidebooks. So many long-standing issues and bugs were fixed thanks to this wonderful Python community.
2630

2731
## <i class="fa-solid fa-handshake-simple"></i> See you at SciPy
2832
[SciPy 2024](https://www.scipy2024.scipy.org/) is just around the corner, and we can't wait to see you there! We're pulling together our pyOpenSci Guide to SciPy, similar to the guide we did for [PyCon 2024](https://www.pyopensci.org/blog/pyos-guide-to-pyconus-2024.html), and wanted to give you a preview of some of the tutorials and talks being given by pyOpenSci Community members!
2933

3034
### Tutorials
35+
#### [Interactive data visualizations with Bokeh (in 2024)](https://cfp.scipy.org/2024/talk/JRLMLD/)
36+
Along with [Timo Metzger](https://cfp.scipy.org/2024/speaker/HMRG3H/) and [Bryan Van de Ven](https://cfp.scipy.org/2024/speaker/BBLFMK/), pyOpenSci community member [Pavithra Eswaramoorthy]() will be covering everything you need to know to create beautiful and powerful interactive plots from scratch using Bokeh’s latest features. Starting with a quick introduction of Bokeh’s core concepts, the team will cover creating and customizing simple static plots like line and bar charts before introducing layers of interactivity, creating specialized plotting features like geographic maps, contour plots, Mathematical Text, and discussing new additions to Bokeh like ImageStacks. By the end, you will be able to create a complete interactive dashboard using Bokeh.
37+
3138
#### [3D Visualization with PyVista](https://cfp.scipy.org/2024/talk/GKGRWE/)
3239
Led by [Tetsuo Koyama](https://cfp.scipy.org/2024/speaker/WY7NA9/), [Alexander Kaszynski](https://cfp.scipy.org/2024/speaker/3RXHG8/), [Bill Little](https://cfp.scipy.org/2024/speaker/FDVWDN/), and [Bane Sullivan](https://cfp.scipy.org/2024/speaker/NEC33M/), this tutorial demonstrates [PyVista's](https://github.com/pyvista/pyvista) latest capabilities and bring a wide range of users to the forefront of 3D visualization in Python, including:
3340

@@ -37,13 +44,32 @@ Led by [Tetsuo Koyama](https://cfp.scipy.org/2024/speaker/WY7NA9/), [Alexander K
3744
* Know which Python libraries are used and can be used by PyVista (meshio, trimesh etc).
3845
* We see this tutorial catering to anyone who wants to visualize data in any domain, and this ranges from basic Python users to advanced power users.
3946

47+
#### [From RAGS to riches: Build an AI document inquiry web-app](https://cfp.scipy.org/2024/talk/W3ZJWG/)
48+
[Pavithra Eswaramoorthy](https://cfp.scipy.org/2024/speaker/QGMGFB/), a Developer Advocate at [Quansight](https://quansight.com/), is teaming up with [Dharhas Pothina](https://cfp.scipy.org/2024/speaker/EKHUEY/) and [Andrew Huang](https://cfp.scipy.org/2024/speaker/QSLXKW/) to cover how to use RAG to build document-inquiry chat systems using different commercial and locally running LLMs. The topics we’ll cover include:
49+
50+
* **Introduction to RAG**, how it works and interacts with LLMs, and Ragna - a framework for RAG orchestration
51+
* **Creating and optimizing a basic chat function** that uses popular LLMs (like GPT) answers questions about your documents, using a Python API in Jupyter Notebooks
52+
* **Running a local LLM on GPUs** on the provided platform, and comparing its performance to commercial LLMs
53+
* **Introduction to Panel**, by creating a basic chat UI for Ragna using Panel’s ChatBox widget
54+
* **Building and deploying a Panel-based web-app**, which extends the basic chat UI and includes more application components
55+
56+
By the end of this tutorial, you will have an understanding of the fundamental components that form a RAG model, and practical knowledge of open source tools that can help you or your organization explore and build on your own applications. This tutorial is designed to enable enthusiasts in our community to explore an interesting topic using some beginner-friendly Python libraries.
57+
4058
#### [Create Your First Python Package: Make Your Python Code Easier to Share and Use](https://cfp.scipy.org/2024/talk/QT9GBY/)
4159
Led by pyOpenSci's Executive Director and Founder, [Leah Wasser](https://github.com/lwasser), in this tutorial session you'll learn how to:
4260

4361
* Create code that can be installed into different environments
4462
* Use [Hatch](https://hatch.pypa.io/latest/) as a workflow tool, making setup and installation of your code easier
4563
* Use Hatch to publish your package to (test) PyPI
4664

65+
Creating code that can be shared and reused is the gold-standard of open science. But tools and skills to share your code can be tricky to learn. In this hands-on tutorial, you’ll learn how to turn your code into an installable Python module ( a file containing Python code that defines functions, classes, and variables), that can be shared with others. We will provide pre-built code and module examples for each step of the tutorial to make it more beginner-friendly. However, to get the most out of this tutorial, you should be familiar with writing Python code, writing and using functions and you should understand the concept of Python environments. You should also have Python and Hatch installed on your computer prior to attending. If you are newer to environments and don’t have a preferred tool, then we recommend that you use conda. However, any environment tool that you prefer works well.
66+
67+
Interested in helping out with our Packaging workshop? Reach out to us at [media@pyopensci.org](mailto:media@pyopensci.org). We're looking for volunteers to assist learners throughout the workshop.
68+
{ .notice}
69+
70+
#### [Data of an Unusual Size (2024 edition): A practical guide to analysis and interactive visualization of massive datasets](https://cfp.scipy.org/2024/talk/UKLNLQ/)
71+
[Pavithra Eswaramoorthy](https://cfp.scipy.org/2024/speaker/QGMGFB/), and [Dharhas Pothina](https://cfp.scipy.org/2024/speaker/EKHUEY/), both from [Quansight](https://quansight.com/), will help you learn the fundamentals of analyzing massive datasets with real-world examples on actual powerful machines on a public cloud provided by the presenters – starting from how the data is stored and read, to how it is processed and visualized.
72+
4773
### Talks
4874
#### [The power of community in solving scientific Python’s most challenging problems](https://cfp.scipy.org/2024/talk/AMTLJ7/)
4975
This talk from our Executive Director and Founder, [Leah Wasser](https://github.com/lwasser), shares pyOpenSci’s knowledge in building a constructive open peer review process that supports maintainers, and also resources that make Python packaging more accessible and easier-to-navigate. Our insights are based on five years of experience working with over 200 community members, who have evaluated 58 packages developed by over 56 package maintainers.
@@ -57,6 +83,16 @@ This talk from [Rich Iannone](https://cfp.scipy.org/2024/speaker/NJCPZ7/), whose
5783
#### [From Code to Clarity: Using Quarto for Python Documentation](https://cfp.scipy.org/2024/talk/GFGSTS/)
5884
[Isabel Zimmerman](https://cfp.scipy.org/2024/speaker/AVSVUJ/), a member of our triage team and an Emeritus Editor in Chief, will be speaking from her experiences as an open source developer who works with multilingual tools, and how Quarto + quartodoc helped to fill those gaps. This talk is intended for anyone who might have to communicate with stakeholders through code using a web-based format. She'll be focusing on Python package documentation websites, but the stories and examples expose the difficulties in many types of technical communication. The intent is to have a lighthearted talk filled with examples of how to make technical communication and publishing easier and more accessible.
5985

86+
#### [Vector space embeddings and data maps for cyber defense](https://cfp.scipy.org/2024/talk/STUXTH/)
87+
pyOpenSci community member [Benoit Hamelin](https://cfp.scipy.org/2024/speaker/SAQUZD/) will cover how the vast amounts of information of interest to cyber defense organizations comes in the form of unstructured data; from host-based telemetry and malware binaries, to phishing emails and network packet sequences. All of this data is extremely challenging to analyze. In recent years there have been huge advances in the methodology for converting unstructured media into vectors. However, leveraging such techniques for cyber defense data remains a challenge.
88+
89+
Imposing structure on unstructured data allows us to leverage powerful data science and machine learning tools. Structure can be imposed in multiple ways, but vector space representations, with a meaningful distance measure, have proven to be one of the most fruitful.
90+
91+
In this talk, he will demonstrate a number of techniques for embedding cyber defense data into vector spaces, as well as discuss how to leverage manifold learning techniques, clustering, and interactive data visualization to broaden our understanding of the data and enrich it with expert feedback.
92+
93+
### Sprints
94+
We'll also be holding a pyOpenSci sprint at SciPy 2024! Be sure to follow us on [LinkedIn](https://www.linkedin.com/company/pyopensci) and [Fosstodon](https://fosstodon.org/@pyOpenSci) for more details!
95+
6096
## <i class="fa-solid fa-envelope-open"></i> Save the Date: pyOpenSci's Open Source Fall Festival
6197
<figure>
6298
<a href="/images/blog/2024/may/oss-fall-festival.png">

0 commit comments

Comments
 (0)