Finite-state chart constraints for reduced complexity context-free parsing pipelines

old_uid6542
titleFinite-state chart constraints for reduced complexity context-free parsing pipelines
start_date2009/03/19
schedule10h-12h
onlineno
location_infosalle 134
summaryThe cubic complexity of context-free inference limits the applicability of syntactic parsers when applications have either strong time constraints (e.g., real-time requirements) or deal with very large data sets. Reducing the complexity of these algorithms is key to leveraging syntactic information in such applications. In the first part of this talk, we consider classifying word positions by whether or not they can either start or end multi-word constituents. This provides a mechanism for "closing" chart cells during context-free inference, which is demonstrated to improve efficiency and accuracy when used to constrain the well-known Charniak parser. Additionally, we present a method for "closing" a sufficient number of chart cells to ensure quadratic worst-case complexity of context-free inference. Empirical results show that this O(n^2) bound can be achieved without impacting parsing accuracy. In the last part of this talk, we consider the impact of these constraints on exhaustive CYK parsing, and demonstrate that the quadratic complexity methods discussed earlier yield observed linear time performance. This suggests an alternate method for applying these constraints that can achieve linear or O(N log N) parsing pipeline complexity. Empirical results demonstrate the utility of the new methods.
responsibles<not specified>