You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/zeroshot_engine.egg-info/PKG-INFO
+64-49Lines changed: 64 additions & 49 deletions
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
Metadata-Version: 2.4
2
2
Name: zeroshot-engine
3
-
Version: 0.1.2
3
+
Version: 0.1.3
4
4
Summary: A zero-shot classification engine based on various LLM models
5
5
Author-email: Lucas Schwarz <luc.schwarz@posteo.de>
6
6
License:
@@ -228,14 +228,24 @@ Dynamic: license-file
228
228
229
229
# zeroshot-engine
230
230
231
-
A scientific zero-shot text classification engine based on various LLM models.
231
+
A open-source scientific zero-shot text classification engine based on various LLM models.
232
232
233
-
## Description
233
+
## 📖 About this package
234
+
235
+
### Description
234
236
235
237
This project provides a flexible framework for performing zero-shot classification using large language models and pandas. It allows you to classify text into categories without requiring explicit training data for those categories. All instructions to LLMs are provided by mere natural language prompts. The framework is designed to support a wide range of text classification tasks including multi-label, multi-class, and single-class classification scenarios.
236
238
237
-
## Features
239
+
### Purpose
240
+
241
+
This package was developed as part of an academic research project to systematically classify political communication. The primary goal was to create an easy-to-use and accessible framework for building adaptable zero-shot classifications with large language models (LLMs) across a wide variety of text analysis tasks. By providing a flexible and intuitive tool, this project aims to empower students and researchers — especially those in social sciences — to explore, evaluate, and harness the potential of zero-shot classification while addressing its challenges in a user-friendly environment. I have no financial interest in this project.
242
+
243
+
### Open-Source and Non-Commercial
244
+
245
+
This project is fully open-source and was developed with no financial interests. It is intended to support academic research and the broader scientific community. Contributions are welcome to help improve the framework and expand its capabilities.
238
246
247
+
## ✨ Features
248
+
### Overview
239
249
* Handles multi-label, multi-class, and single-class classification tasks.
240
250
* Option for incorporating few-shot learning through the flexible prompt engineering approach.
@@ -251,46 +261,33 @@ This project provides a flexible framework for performing zero-shot classificati
251
261
* **Multi-Prompting:** The use of multiple different prompts for different tasks to elicit more comprehensive and reliable predictions from the model.
252
262
* **Modular Prompt Design:** While not automated in the current implementation, the modular prompt design with text blocks facilitates manual testing and refinement of prompts to improve classification accuracy.
253
263
264
+
### Core Modules
265
+
* **Iterative Double Validated Zero-Shot Classification (IDZSC)**: IDZSC is the core module to classify texts in an iterative process. It can use a double validation technique to ensure the robustness and accuracy of the classifications.
266
+
* **Hierarchical Double Validated Zero-Shot Classification (HDZSC)**: HDZSC extends the zero-shot classification capabilities to hierarchical category structures. It leverages a double validation approach to maintain accuracy while navigating the complexities of hierarchical classification.
254
267
255
-
## Installation
256
-
268
+
## 🚀 Get Started
269
+
### How to install
270
+
Install the `zeroshot-engine` package using pip in your Windows Powershell or Linux / Mac Bash Terminal.
257
271
```bash
258
272
pip install zeroshot-engine
259
273
```
260
274
261
-
##Demo
262
-
275
+
### Interactive Demo in the Command Line
276
+
Test the `zeroshot-engine` in the HDZSC-scenario by selecting from a wide variety of LLMs and bringing your own text for classification:
263
277
```bash
264
278
zeroshot-engine demo
265
279
```
280
+
This command will guide you through an interactive demo where you can:
IDZSC is the core module to classify texts in an iterative process. It can use a double validation technique to ensure the robustness and accuracy of the classifications.
282
+
* Choose an LLM model (e.g., one from OpenAI or Ollama).
283
+
* Provide your own text for classification or use a provided example text.
284
+
* Observe how the hierarchical classification process works in real-time.
### Run your first Zeroshot Classification Project in Python
280
287
281
-
HDZSC extends the zero-shot classification capabilities to hierarchical category structures. It leverages a double validation approach to maintain accuracy while navigating the complexities of hierarchical classification.
288
+
This **[tutorial](docs/Tutorial_Get_Started.md)** provides example code for your first test project, which you can use as a template to build and adapt your own research projects. For more detailed information and advanced usage, please refer to the documentation.
282
289
283
-
### Planned Features
284
-
285
-
* Highest Priority: Extend functionality, that user can use more than the demo.
286
-
* Improved documentation and examples.
287
-
* Create prompting guidelines.
288
-
* Better integration and testing of validation metrics.
289
-
* Automated Logging System
290
-
* Add contribution guidelines.
291
-
* Support for more LLMs and APIs.
292
-
293
-
### Documentation
290
+
## 📚 Documentation
294
291
For more detailed information about the framework and its implementation, please refer to the following documentation:
295
292
296
293
* **[Overview of IDZSC and HDZSC](docs/Overview_IDZSC_and_HDZSC.md)** - A comprehensive explanation of the Iterative and Hierarchical Double Zero-Shot Classification approaches, including detailed examples and usage patterns.
@@ -301,10 +298,8 @@ For more detailed information about the framework and its implementation, please
301
298
302
299
* **[Tutorial: Get started with your first classification](docs/Tutorial_Get_Started.md)** - Create your first projects with prompt, code examples and text to learn how to set up the classifer.
* Additional tutorial for double validation vs. zero-temp approach.
353
+
* Documentation of all relevant functions.
354
+
* Create prompting guidelines.
355
+
* Better integration and testing of validation metrics.
356
+
* Automated Logging System
357
+
* Add contribution guidelines.
358
+
* Support for more LLMs and APIs.
359
+
360
+
## 🚧 Notice: Under Development
361
+
362
+
> **Note:**
363
+
> While the core functionality of `zeroshot-engine` is already up and running, this project is still under active development.
364
+
> There may be bugs, incomplete features, or areas for improvement.
365
+
>
366
+
> If you encounter any issues, have feature requests, or would like to contribute code to the project, please feel free to:
367
+
> - Open an issue on the [GitHub repository](https://github.com/your-repo-link/issues).
368
+
> - Submit a pull request with your contributions.
369
+
> - Contact the author directly at **luc.schwarz@posteo.de**.
370
+
>
371
+
> Contributions are highly appreciated and will help improve the framework for the scientific community!
372
+
373
+
## 📜 License
356
374
357
375
This project is licensed under the Apache 2.0 License - see the [LICENSE](LICENSE) file for details.
358
376
359
-
## Contributing
377
+
## 🫱🏼🫲🏼 Contributing
360
378
361
-
We welcome contributions! Feel free to open issues for bug reports or feature requests. If you'd like to contribute code directly, please see our [contributing guidelines](CONTRIBUTING.md).
379
+
Contributions are welcome! Feel free to open issues for bug reports or feature requests. If you'd like to contribute code directly, please see the [contributing guidelines](CONTRIBUTING.md).
362
380
363
-
## Author
381
+
## 🤵 Author
364
382
365
383
Lucas Schwarz
366
384
367
-
## Contact
385
+
## 📧 Contact
368
386
369
387
luc.schwarz@posteo.de
370
388
371
-
## Citation
389
+
## 🏛️ Citation
372
390
373
391
If you use `zeroshot-engine` in your research, please cite it as follows:
374
392
375
-
> Schwarz, L. (2025) „zeroshot-engine: A scientific zero-shot text classification engine based on various LLM models“. Zenodo. doi: 10.5281/zenodo.15077242.
393
+
> Schwarz, L. (2025) „zeroshot-engine: A scientific zero-shot text classification engine based on various LLM models“. Zenodo. doi: 10.5281/zenodo.15079108.
376
394
377
395
```bibtex
378
-
@software{schwarz_2025_15077242,
396
+
@software{schwarz_2025_15079108,
379
397
author = {Schwarz, Lucas},
380
398
title = {zeroshot-engine: A scientific zero-shot text
0 commit comments