text processing bahasa indonesia PHP
Bahasata is Example Text Processing for bahasa indonesia written in PHP.
for now only can stemmer and tokenizer
Bahasata dapat diinstall melalui Composer.
composer require muhfirdaus19/bahasata:dev-main
memisahkan kata, kalimat
use Bahasata\Bahasata;
// include autoloader
require './vendor/autoload.php';
$bahasata = new Bahasata();
$write = $bahasata->write('tetap bersama, jaga kesehatan!');
$result = $write->get();
// tetap bersama, jaga kesehatan!
$result = $write->wordsTokenizer()->get();
// ['tetap' ,'bersama' ,'jaga' ,'kesehatan']
$result = $write->sentencesTokenizer()->get();
// ['tetap bersama' ,'jaga kesehatan']
print_r($result);
mencari kata dasar dari sebuat kalimat/kata. contoh : memakan -> makan
use Bahasata\Bahasata;
// include autoloader
require './vendor/autoload.php';
$bahasata = new Bahasata();
$result = $bahasata->stem('merekomendasikan');
// rekomendasi
$write = $bahasata->write('saya rekomendasikan untuk memakan sayur');
$result = $write->wordsTokenizer()->stem()->get();
// ['saya', 'rekomendasi', 'untuk', 'makan', 'sayur']
print_r($result);
The muhfirdaus19/bahasata library is copyright © Muhammad Firdaus and licensed for use under the terms of the MIT License (MIT). Please see LICENSE for more information.