All projects
academic Chrome Extension

BISTAG

Bisaya-Tagalog Morphological Analyzer

A finite-state morphological analyzer for mixed Bisaya-Tagalog text, built using a Python-based Non-Deterministic Finite Automaton.

BISTAG

Overview

Code-switching is normal in the Philippines — people mix Bisaya and Tagalog mid-sentence without thinking about it. But most NLP tools don't know what to do with that. BISTAG tackles that directly: it performs morphological decomposition and code-switching detection on mixed Bisaya-Tagalog text using an NFA-based approach. The system handles tokenization, lemmatization, POS tagging, and morphological parsing — covering affixes, reduplication, and other patterns specific to Philippine languages. It's a chrome extension too, so it's usable outside of a notebook.

Gallery

BISTAG — view 1