public class TokenizerPreWhitespaceSplit extends Object implements TokenizerPre
| Constructor and Description |
|---|
TokenizerPreWhitespaceSplit(List<Integer> idCols,
int tokenizeCol,
org.apache.wink.json4j.JSONObject params) |
| Modifier and Type | Method and Description |
|---|---|
List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.Token> |
splitToTokens(String text) |
List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.DocumentToTokens> |
tokenizePre(FrameBlock in) |
public List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.Token> splitToTokens(String text)
public List<org.apache.sysds.runtime.transform.tokenize.Tokenizer.DocumentToTokens> tokenizePre(FrameBlock in)
tokenizePre in interface TokenizerPreCopyright © 2021 The Apache Software Foundation. All rights reserved.