文本分析项目-源码

上传者: 42116681 | 上传时间: 2021-02-25 10:02:38 | 文件大小: 148KB | 文件类型: ZIP
德语句子的自动复杂度评估 团队成员 里奥·阮·拉乌尔·贝格·康拉德·斯特劳布·蒂尔·诺彻 邮件地址 现有代码片段 利用的图书馆 运行代码(稍后将设置主入口点) 下载数据集: python download_data.py 项目状态 数据分析 我们的主要数据源是TextComplexityDE 19数据集( ),其中包含1000个德语句子,由外语学习者在7点Likert量表上标记为A级和B级,其中1表示低复杂度,高可读性句子,而7则相反。 其中900个句子来自23篇德国Wikipedia文章,其余100则来自Leichte Sprache。 数据集中的每个句子至少由5个人标记,数据集中提供了它们的平均评分。 除了复杂性/可读性之外,还收集了句子的可理解性和词汇难度得分。 图:饼图显示(四舍五入的)评级分布。 评级不是平均分配的,因为平均没有句子收到7,而很少有人得到6。在句子的

文件下载

资源详情

[{"title":"( 56 个子文件 148KB ) 文本分析项目-源码","children":[{"title":"text-analytics-project-master","children":[{"title":".example.env <span style='color:#111;'> 26B </span>","children":null,"spread":false},{"title":".github","children":[{"title":"workflows","children":[{"title":"test.yml <span style='color:#111;'> 606B </span>","children":null,"spread":false},{"title":"lint.yml <span style='color:#111;'> 345B </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"src","children":[{"title":"utils","children":[{"title":"sample.py <span style='color:#111;'> 855B </span>","children":null,"spread":false},{"title":"regression.py <span style='color:#111;'> 1.81KB </span>","children":null,"spread":false},{"title":"dimension_reduction.py <span style='color:#111;'> 1001B </span>","children":null,"spread":false},{"title":"wordlists.py <span style='color:#111;'> 9.32KB </span>","children":null,"spread":false},{"title":"experiments.py <span style='color:#111;'> 2.17KB </span>","children":null,"spread":false},{"title":"sentencestats.py <span style='color:#111;'> 8.34KB </span>","children":null,"spread":false},{"title":"preprocessing.py <span style='color:#111;'> 2.66KB </span>","children":null,"spread":false},{"title":"evaluater.py <span style='color:#111;'> 7.51KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"vectorizer.py <span style='color:#111;'> 6.32KB </span>","children":null,"spread":false},{"title":"gpu.py <span style='color:#111;'> 414B </span>","children":null,"spread":false},{"title":"word2vec.py <span style='color:#111;'> 4.71KB </span>","children":null,"spread":false},{"title":"downloader.py <span style='color:#111;'> 3.93KB </span>","children":null,"spread":false},{"title":"to_dataframe.py <span style='color:#111;'> 16.09KB </span>","children":null,"spread":false},{"title":"visualizer.py <span style='color:#111;'> 7.51KB </span>","children":null,"spread":false},{"title":"traverser.py <span style='color:#111;'> 11.79KB </span>","children":null,"spread":false},{"title":"trainer.py <span style='color:#111;'> 7.05KB </span>","children":null,"spread":false},{"title":"BERT.py <span style='color:#111;'> 2.71KB </span>","children":null,"spread":false},{"title":"clustering.py <span style='color:#111;'> 3.54KB </span>","children":null,"spread":false}],"spread":false},{"title":"main.py <span style='color:#111;'> 4.45KB </span>","children":null,"spread":false},{"title":"data","children":[{"title":".gitkeep <span style='color:#111;'> 0B </span>","children":null,"spread":false}],"spread":true},{"title":"exploration.py <span style='color:#111;'> 23.13KB </span>","children":null,"spread":false}],"spread":true},{"title":".vscode","children":[{"title":"extensions.json <span style='color:#111;'> 158B </span>","children":null,"spread":false},{"title":"settings.json <span style='color:#111;'> 1.51KB </span>","children":null,"spread":false}],"spread":true},{"title":"requirements.txt <span style='color:#111;'> 296B </span>","children":null,"spread":false},{"title":"Pipfile.lock <span style='color:#111;'> 104.78KB </span>","children":null,"spread":false},{"title":".idea","children":[{"title":"misc.xml <span style='color:#111;'> 292B </span>","children":null,"spread":false},{"title":"workspace.xml <span style='color:#111;'> 2.97KB </span>","children":null,"spread":false},{"title":"inspectionProfiles","children":[{"title":"profiles_settings.xml <span style='color:#111;'> 174B </span>","children":null,"spread":false}],"spread":true},{"title":"modules.xml <span style='color:#111;'> 296B </span>","children":null,"spread":false},{"title":"text-analytics-project.iml <span style='color:#111;'> 474B </span>","children":null,"spread":false},{"title":"vcs.xml <span style='color:#111;'> 180B </span>","children":null,"spread":false}],"spread":true},{"title":".isort.cfg <span style='color:#111;'> 52B </span>","children":null,"spread":false},{"title":"LICENSE <span style='color:#111;'> 1.04KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 7.55KB </span>","children":null,"spread":false},{"title":"Pipfile <span style='color:#111;'> 817B </span>","children":null,"spread":false},{"title":"htmlcov","children":[{"title":"jquery.ba-throttle-debounce.min.js <span style='color:#111;'> 731B </span>","children":null,"spread":false},{"title":"index.html <span style='color:#111;'> 3.14KB </span>","children":null,"spread":false},{"title":"jquery.hotkeys.js <span style='color:#111;'> 2.99KB </span>","children":null,"spread":false},{"title":"keybd_closed.png <span style='color:#111;'> 112B </span>","children":null,"spread":false},{"title":"tests_sample_test_py.html <span style='color:#111;'> 5.57KB </span>","children":null,"spread":false},{"title":"status.json <span style='color:#111;'> 543B </span>","children":null,"spread":false},{"title":"jquery.min.js <span style='color:#111;'> 93.54KB </span>","children":null,"spread":false},{"title":"jquery.tablesorter.min.js <span style='color:#111;'> 12.50KB </span>","children":null,"spread":false},{"title":"jquery.isonscreen.js <span style='color:#111;'> 1.47KB </span>","children":null,"spread":false},{"title":"coverage_html.js <span style='color:#111;'> 18.18KB </span>","children":null,"spread":false},{"title":"_venv_lib_python3_7_site-packages__virtualenv_py.html <span style='color:#111;'> 35.49KB </span>","children":null,"spread":false},{"title":"keybd_open.png <span style='color:#111;'> 112B </span>","children":null,"spread":false},{"title":"style.css <span style='color:#111;'> 11.41KB </span>","children":null,"spread":false}],"spread":false},{"title":".pre-commit-config.yaml <span style='color:#111;'> 952B </span>","children":null,"spread":false},{"title":"tests","children":[{"title":"sample_test.py <span style='color:#111;'> 310B </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false}],"spread":true},{"title":".gitignore <span style='color:#111;'> 413B </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明