Fakerycoder
diff --git a/‎EXAMPLES.md
Lines changed: 3 additions & 3 deletions b/‎EXAMPLES.md
Lines changed: 3 additions & 3 deletions
diff --git a/‎README.md
Lines changed: 33 additions & 19 deletions b/‎README.md
Lines changed: 33 additions & 19 deletions
diff --git a/‎setup.py
Lines changed: 2 additions & 1 deletion b/‎setup.py
Lines changed: 2 additions & 1 deletion
diff --git a/‎supar/__init__.py
Lines changed: 9 additions & 12 deletions b/‎supar/__init__.py
Lines changed: 9 additions & 12 deletions
diff --git a/‎supar/cmds/cmd.py
Lines changed: 0 additions & 1 deletion b/‎supar/cmds/cmd.py
Lines changed: 0 additions & 1 deletion
diff --git a/‎supar/models/__init__.py
Lines changed: 1 addition & 3 deletions b/‎supar/models/__init__.py
Lines changed: 1 addition & 3 deletions
diff --git a/‎supar/modules/affine.py
Lines changed: 4 additions & 3 deletions b/‎supar/modules/affine.py
Lines changed: 4 additions & 3 deletions
diff --git a/‎supar/parsers/__init__.py
Lines changed: 0 additions & 2 deletions b/‎supar/parsers/__init__.py
Lines changed: 0 additions & 2 deletions
diff --git a/‎supar/parsers/con.py
Lines changed: 19 additions & 9 deletions b/‎supar/parsers/con.py
Lines changed: 19 additions & 9 deletions
@@ -15,7 +15,7 @@ $ python -u -m supar.cmds.biaffine_dep train -b -d 0 -c biaffine-dep-en -p model
     --dev ptb/dev.conllx  \
     --test ptb/test.conllx  \
     --embed glove.6B.100d.txt  \
-    --unk
+    --unk unk
 # crf2o
 $ python -u -m supar.cmds.crf2o_dep train -b -d 0 -c crf2o-dep-en -p model -f char  \
     --train ptb/train.conllx  \
@@ -30,7 +30,7 @@ The option `-c` controls where to load predefined configs, you can either specif
 For CRF models, you need to specify `--proj` to remove non-projective trees.
 Specifying `--mbr` to perform MBR decoding often leads to consistent improvement.
 
-The model finetuned on [`robert-large`](https://huggingface.co/roberta-large) achieves nearly state-of-the-art performance in English dependency parsing.
+The model trained by finetuning [`robert-large`](https://huggingface.co/roberta-large) achieves nearly state-of-the-art performance in English dependency parsing.
 Here we provide some recommended hyper-parameters (not the best, but good enough).
 You are allowed to set values of registered/unregistered parameters in bash to suppress default configs in the file.
 ```sh
@@ -46,7 +46,7 @@ $ python -u -m supar.cmds.biaffine_dep train -b -d 0 -c biaffine-dep-roberta-en
     --epochs=10  \
     --update-steps=4
 ```
-The pretrained multilingual model `biaffine-dep-xlmr` takes [`xlm-roberta-large`](https://huggingface.co/xlm-roberta-large) as backbone architecture and finetunes on it.
+The pretrained multilingual model `biaffine-dep-xlmr` takes [`xlm-roberta-large`](https://huggingface.co/xlm-roberta-large) as backbone architecture and finetunes it.
 The training command is as following:
 ```sh
 $ python -u -m supar.cmds.biaffine_dep train -b -d 0 -c biaffine-dep-xlmr -p model  \
 
@@ -45,7 +45,7 @@ All results are tested on the machine with Intel(R) Xeon(R) CPU E5-2650 v4 @ 2.2
 
 English and Chinese dependency parsing models are trained on PTB and CTB7 respectively.
 For each parser, we provide pretrained models that take BiLSTM as encoder.
-We also provide models finetuned on pretrained language models from [Huggingface Transformers](https://github.com/huggingface/transformers).
+We also provide models trained by finetuning pretrained language models from [Huggingface Transformers](https://github.com/huggingface/transformers).
 We use [`robert-large`](https://huggingface.co/roberta-large) for English and [`hfl/chinese-electra-180g-large-discriminator`](https://huggingface.co/hfl/chinese-electra-180g-large-discriminator) for Chinese.
 During evaluation, punctuation is ignored in all metrics for PTB.
 
@@ -110,14 +110,14 @@ The results of each treebank are as follows.
 English semantic dependency parsing models are trained on [DM data introduced in SemEval-2014 task 8](https://catalog.ldc.upenn.edu/LDC2016T10), while Chinese models are trained on [NEWS domain data of corpora from SemEval-2016 Task 9](https://github.com/HIT-SCIR/SemEval-2016).
 Our data preprocessing steps follow [Second_Order_SDP](https://github.com/wangxinyu0922/Second_Order_SDP).
 
-| Name                      |   P   |   R   | F<sub>1 | Sents/s |
-| ------------------------- | :---: | :---: | :-----: | ------: |
-| `biaffine-sdp-en`         | 94.35 | 93.12 |  93.73  | 1067.06 |
-| `vi-sdp-en`               | 94.36 | 93.52 |  93.94  |  821.73 |
-| `biaffine-sdp-roberta-en` | 95.07 | 95.22 |  95.15  |  269.05 |
-| `biaffine-sdp-zh`         | 72.93 | 66.29 |  69.45  |  523.36 |
-| `vi-sdp-zh`               | 72.05 | 67.97 |  69.95  |  411.94 |
-| `biaffine-sdp-electra-zh` | 71.49 | 70.08 |  70.78  |  143.04 |
+| Name                |   P   |   R   | F<sub>1 | Sents/s |
+| ------------------- | :---: | :---: | :-----: | ------: |
+| `biaffine-sdp-en`   | 94.35 | 93.12 |  93.73  | 1067.06 |
+| `vi-sdp-en`         | 94.36 | 93.52 |  93.94  |  821.73 |
+| `vi-sdp-roberta-en` | 95.18 | 95.20 |  95.19  |  264.13 |
+| `biaffine-sdp-zh`   | 72.93 | 66.29 |  69.45  |  523.36 |
+| `vi-sdp-zh`         | 72.05 | 67.97 |  69.95  |  411.94 |
+| `vi-sdp-electra-zh` | 73.29 | 70.53 |  71.89  |  139.52 |
 
 ## Usage
 
@@ -152,12 +152,13 @@ probs: tensor([1.0000, 0.9999, 0.9966, 0.8944, 1.0000, 1.0000, 0.9999])
 ```
 
 `SuPar` also supports parsing from tokenized sentences or from file.
-For semantic dependency parsing, lemmas and POS tags are needed.
+For BiLSTM-based semantic dependency parsing models, lemmas and POS tags are needed.
 
 ```py
 >>> import os
 >>> import tempfile
->>> Parser.load('biaffine-dep-en').predict(['I','saw','Sarah','with','a','telescope','.'], verbose=False)[0]
+>>> dep = Parser.load('biaffine-dep-en')
+>>> dep.predict(['I', 'saw', 'Sarah', 'with', 'a', 'telescope', '.'], verbose=False)[0]
 1       I       _       _       _       _       2       nsubj   _       _
 2       saw     _       _       _       _       0       root    _       _
 3       Sarah   _       _       _       _       2       dobj    _       _
@@ -185,7 +186,7 @@ For semantic dependency parsing, lemmas and POS tags are needed.
 
 ''')
 ...
->>> Parser.load('biaffine-dep-en').predict(path, pred='pred.conllx', verbose=False)[0]
+>>> dep.predict(path, pred='pred.conllx', verbose=False)[0]
 # text = But I found the location wonderful and the neighbors very kind.
 1       But     _       _       _       _       3       cc      _       _
 2       I       _       _       _       _       3       nsubj   _       _
@@ -201,13 +202,26 @@ For semantic dependency parsing, lemmas and POS tags are needed.
 11      kind    _       _       _       _       6       conj    _       _
 12      .       _       _       _       _       3       punct   _       _
 
->>> Parser.load('crf-con-en').predict(['I','saw','Sarah','with','a','telescope','.'], verbose=False)[0]
-(TOP (S (NP (_ I)) (VP (_ saw) (NP (_ Sarah)) (PP (_ with) (NP (_ a) (_ telescope)))) (_ .)))
->>> Parser.load('biaffine-sdp-en').predict([[('I','I','PRP'), ('saw','see','VBD'),
-                                             ('Sarah','Sarah','NNP'), ('with','with','IN'),
-                                             ('a','a','DT'), ('telescope','telescope','NN'),
-                                             ('.','_','.')]],
-                                           verbose=False)[0]
+>>> con = Parser.load('crf-con-en')
+>>> con.predict(['I', 'saw', 'Sarah', 'with', 'a', 'telescope', '.'], verbose=False)[0].pretty_print()
+              TOP                       
+               |                         
+               S                        
+  _____________|______________________   
+ |             VP                     | 
+ |    _________|____                  |  
+ |   |    |         PP                | 
+ |   |    |     ____|___              |  
+ NP  |    NP   |        NP            | 
+ |   |    |    |     ___|______       |  
+ _   _    _    _    _          _      _ 
+ |   |    |    |    |          |      |  
+ I  saw Sarah with  a      telescope  . 
+
+>>> sdp = Parser.load('biaffine-sdp-en')
+>>> sdp.predict([[('I','I','PRP'), ('saw','see','VBD'), ('Sarah','Sarah','NNP'), ('with','with','IN'),
+                  ('a','a','DT'), ('telescope','telescope','NN'), ('.','_','.')]],
+                verbose=False)[0]
 1       I       I       PRP     _       _       _       _       2:ARG1  _
 2       saw     see     VBD     _       _       _       _       0:root|4:ARG1   _
 3       Sarah   Sarah   NNP     _       _       _       _       2:ARG2  _
 
@@ -4,7 +4,7 @@
 
 setup(
     name='supar',
-    version='1.1.0',
+    version='1.1.1',
     author='Yu Zhang',
     author_email='yzhang.cs@outlook.com',
     description='Syntactic/Semantic Parsing Models',
@@ -27,6 +27,7 @@
         'transformers>=4.0.0',
         'nltk',
         'stanza',
+        'opt_einsum',
         'dill'],
     entry_points={
         'console_scripts': [
 
@@ -4,7 +4,7 @@
                       BiaffineSemanticDependencyParser, CRF2oDependencyParser,
                       CRFConstituencyParser, CRFDependencyParser, Parser,
                       VIConstituencyParser, VIDependencyParser,
-                      VISemanticDependencyParser, VISemanticRoleLabelingParser)
+                      VISemanticDependencyParser)
 
 __all__ = ['BiaffineDependencyParser',
            'CRFDependencyParser',
@@ -14,10 +14,9 @@
            'VIConstituencyParser',
            'BiaffineSemanticDependencyParser',
            'VISemanticDependencyParser',
-           'VISemanticRoleLabelingParser',
            'Parser']
 
-__version__ = '1.1.0'
+__version__ = '1.1.1'
 
 PARSER = {parser.NAME: parser for parser in [BiaffineDependencyParser,
                                              CRFDependencyParser,
@@ -26,11 +25,10 @@
                                              CRFConstituencyParser,
                                              VIConstituencyParser,
                                              BiaffineSemanticDependencyParser,
-                                             VISemanticDependencyParser,
-                                             VISemanticRoleLabelingParser]}
-
-SRC = 'http://hlt.suda.edu.cn/LA/yzhang/supar'
+                                             VISemanticDependencyParser]}
 
+SRC = {'github': 'https://github.com/yzhangcs/parser/releases/download',
+       'hlt': 'http://hlt.suda.edu.cn/LA/yzhang/supar'}
 NAME = {
     'biaffine-dep-en': 'ptb.biaffine.dep.lstm.char',
     'biaffine-dep-zh': 'ctb7.biaffine.dep.lstm.char',
@@ -48,9 +46,8 @@
     'biaffine-sdp-zh': 'semeval16.biaffine.sdp.lstm.tag-char-lemma',
     'vi-sdp-en': 'dm.vi.sdp.lstm.tag-char-lemma',
     'vi-sdp-zh': 'semeval16.vi.sdp.lstm.tag-char-lemma',
-    'biaffine-sdp-roberta-en': 'dm.biaffine.sdp.roberta',
-    'biaffine-sdp-electra-zh': 'semeval16.biaffine.sdp.electra'
+    'vi-sdp-roberta-en': 'dm.vi.sdp.roberta',
+    'vi-sdp-electra-zh': 'semeval16.vi.sdp.electra'
 }
-
-MODEL = {n: f'{SRC}/v{__version__}/{m}.zip' for n, m in NAME.items()}
-CONFIG = {n: f'{SRC}/v{__version__}/{m}.ini' for n, m in NAME.items()}
+MODEL = {n: f"{SRC['github']}/v1.1.0/{m}.zip" for n, m in NAME.items()}
+CONFIG = {n: f"{SRC['github']}/v1.1.0/{m}.ini" for n, m in NAME.items()}
@@ -12,7 +12,6 @@ def parse(parser):
     parser.add_argument('--device', '-d', default='-1', help='ID of GPU to use')
     parser.add_argument('--seed', '-s', default=1, type=int, help='seed for generating random numbers')
     parser.add_argument('--threads', '-t', default=16, type=int, help='max num of threads')
-    parser.add_argument('--batch-size', default=5000, type=int, help='batch size')
     parser.add_argument("--local_rank", type=int, default=-1, help='node rank for distributed training')
     args, unknown = parser.parse_known_args()
     args, unknown = parser.parse_known_args(unknown, args)
 
@@ -5,7 +5,6 @@
                   CRFDependencyModel, VIDependencyModel)
 from .model import Model
 from .sdp import BiaffineSemanticDependencyModel, VISemanticDependencyModel
-from .srl import VISemanticRoleLabelingModel
 
 __all__ = ['Model',
            'BiaffineDependencyModel',
@@ -15,5 +14,4 @@
            'CRFConstituencyModel',
            'VIConstituencyModel',
            'BiaffineSemanticDependencyModel',
-           'VISemanticDependencyModel',
-           'VISemanticRoleLabelingModel']
+           'VISemanticDependencyModel']
@@ -2,6 +2,7 @@
 
 import torch
 import torch.nn as nn
+from opt_einsum import contract
 
 
 class Biaffine(nn.Module):
@@ -71,7 +72,7 @@ def forward(self, x, y):
         if self.bias_y:
             y = torch.cat((y, torch.ones_like(y[..., :1])), -1)
         # [batch_size, n_out, seq_len, seq_len]
-        s = torch.einsum('bxi,oij,byj->boxy', x, self.weight, y) / self.n_in ** self.scale
+        s = contract('bxi,oij,byj->boxy', x, self.weight, y) / self.n_in ** self.scale
         # remove dim 1 if n_out == 1
         s = s.squeeze(1)
 
@@ -145,9 +146,9 @@ def forward(self, x, y, z):
             x = torch.cat((x, torch.ones_like(x[..., :1])), -1)
         if self.bias_y:
             y = torch.cat((y, torch.ones_like(y[..., :1])), -1)
-        w = torch.einsum('bzk,oikj->bozij', z, self.weight)
+        w = contract('bzk,oikj->bozij', z, self.weight)
         # [batch_size, n_out, seq_len, seq_len, seq_len]
-        s = torch.einsum('bxi,bozij,byj->bozxy', x, w, y) / self.n_in ** self.scale
+        s = contract('bxi,bozij,byj->bozxy', x, w, y) / self.n_in ** self.scale
         # remove dim 1 if n_out == 1
         s = s.squeeze(1)
 
 
@@ -5,7 +5,6 @@
                   CRFDependencyParser, VIDependencyParser)
 from .parser import Parser
 from .sdp import BiaffineSemanticDependencyParser, VISemanticDependencyParser
-from .srl import VISemanticRoleLabelingParser
 
 __all__ = ['BiaffineDependencyParser',
            'CRFDependencyParser',
@@ -15,5 +14,4 @@
            'VIConstituencyParser',
            'BiaffineSemanticDependencyParser',
            'VISemanticDependencyParser',
-           'VISemanticRoleLabelingParser',
            'Parser']
@@ -7,7 +7,7 @@
 from supar.models import CRFConstituencyModel, VIConstituencyModel
 from supar.parsers.parser import Parser
 from supar.utils import Config, Dataset, Embedding
-from supar.utils.common import bos, eos, pad, unk
+from supar.utils.common import BOS, EOS, PAD, UNK
 from supar.utils.field import ChartField, Field, RawField, SubwordField
 from supar.utils.logging import get_logger, progress_bar
 from supar.utils.metric import SpanMetric
@@ -129,7 +129,7 @@ def predict(self, data, pred=None, lang=None, buckets=8, batch_size=5000, prob=F
         return super().predict(**Config().update(locals()))
 
     @classmethod
-    def load(cls, path, reload=False, **kwargs):
+    def load(cls, path, reload=False, src=None, **kwargs):
         r"""
         Loads a parser with data fields and pretrained model parameters.
 
@@ -140,6 +140,11 @@ def load(cls, path, reload=False, **kwargs):
                 - a local path to a pretrained model, e.g., ``./<path>/model``.
             reload (bool):
                 Whether to discard the existing cache and force a fresh download. Default: ``False``.
+            src (str):
+                Specifies where to download the model.
+                ``'github'``: github release page.
+                ``'hlt'``: hlt homepage, only accessible from 9:00 to 18:00 (UTC+8).
+                Default: None.
             kwargs (dict):
                 A dict holding unconsumed arguments for updating training configs and initializing the model.
 
@@ -149,7 +154,7 @@ def load(cls, path, reload=False, **kwargs):
             >>> parser = Parser.load('./ptb.crf.con.lstm.char')
         """
 
-        return super().load(path, reload, **kwargs)
+        return super().load(path, reload, src, **kwargs)
 
     def _train(self, loader):
         self.model.train()
@@ -246,7 +251,7 @@ def build(cls, path, min_freq=2, fix_len=20, **kwargs):
             return parser
 
         logger.info("Building the fields")
-        WORD = Field('words', pad=pad, unk=unk, bos=bos, eos=eos, lower=True)
+        WORD = Field('words', pad=PAD, unk=UNK, bos=BOS, eos=EOS, lower=True)
         TAG, CHAR, BERT = None, None, None
         if args.encoder != 'lstm':
             from transformers import (AutoTokenizer, GPT2Tokenizer,
@@ -262,11 +267,11 @@ def build(cls, path, min_freq=2, fix_len=20, **kwargs):
                                 fn=None if not isinstance(t, (GPT2Tokenizer, GPT2TokenizerFast)) else lambda x: ' '+x)
             WORD.vocab = t.get_vocab()
         else:
-            WORD = Field('words', pad=pad, unk=unk, bos=bos, eos=eos, lower=True)
+            WORD = Field('words', pad=PAD, unk=UNK, bos=BOS, eos=EOS, lower=True)
             if 'tag' in args.feat:
-                TAG = Field('tags', bos=bos, eos=eos)
+                TAG = Field('tags', bos=BOS, eos=EOS)
             if 'char' in args.feat:
-                CHAR = SubwordField('chars', pad=pad, unk=unk, bos=bos, eos=eos, fix_len=args.fix_len)
+                CHAR = SubwordField('chars', pad=PAD, unk=UNK, bos=BOS, eos=EOS, fix_len=args.fix_len)
             if 'bert' in args.feat:
                 from transformers import (AutoTokenizer, GPT2Tokenizer,
                                           GPT2TokenizerFast)
@@ -411,7 +416,7 @@ def predict(self, data, pred=None, lang=None, buckets=8, batch_size=5000, prob=F
         return super().predict(**Config().update(locals()))
 
     @classmethod
-    def load(cls, path, reload=False, **kwargs):
+    def load(cls, path, reload=False, src=None, **kwargs):
         r"""
         Loads a parser with data fields and pretrained model parameters.
 
@@ -422,6 +427,11 @@ def load(cls, path, reload=False, **kwargs):
                 - a local path to a pretrained model, e.g., ``./<path>/model``.
             reload (bool):
                 Whether to discard the existing cache and force a fresh download. Default: ``False``.
+            src (str):
+                Specifies where to download the model.
+                ``'github'``: github release page.
+                ``'hlt'``: hlt homepage, only accessible from 9:00 to 18:00 (UTC+8).
+                Default: None.
             kwargs (dict):
                 A dict holding unconsumed arguments for updating training configs and initializing the model.
 
@@ -431,7 +441,7 @@ def load(cls, path, reload=False, **kwargs):
             >>> parser = Parser.load('./ptb.vi.con.lstm.char')
         """
 
-        return super().load(path, reload, **kwargs)
+        return super().load(path, reload, src, **kwargs)
 
     def _train(self, loader):
         self.model.train()