机器学习开源工具集-MLOSS(Machine learning open source software)

+3 投票

之前shinchen同学提出“一起学习Mahout",大家反响得很强烈,甚至有同学建议建一个C++版的机器学习开源项目。本着“不重复造轮子”的原则,我google了一下“机器学习开源项目”,发现这方面国内总结的不多,倒是发现了一个国外的非常不错的机器学习开源工具集-MLOSS(Machine learning open source software),这个网站上目前已经收集了400多个开源的机器学习工具包,各种语言各种算法实现,对于机器学习或数据挖掘感兴趣的朋友来说,绝对是一个宝库。关于MLOSS的背景和目标,以下引用其官方网站的说明:

Background

Open source tools have recently reached a level of maturity which makes them suitable for building large-scale real-world systems. At the same time, the field of machine learning has developed a large body of powerful learning algorithms for a wide range of applications. Inspired by similar efforts in bioinformatics (BOSC) or statistics (useR), our aim is to build a forum for open source software in machine learning.

 

  • If you want more background about why open source software is important for machine learning, read our position paper about the need for open source software in machine learning.
  • If you have written machine learning software, consider adding it to the projects at mloss.org.
  • In case your machine learning software can be considered a useful, mature piece of work consider a submission to the JMLR track for machine learning open source software.

Goals

Our goal is to support a community creating a comprehensive open source machine learning environment. Ultimately, open source machine learning software should be able to compete with existing commercial closed source solutions. To this end, it is not enough to bring existing and freshly developed toolboxes and algorithmic implementations to people's attention. More importantly the MLOSS platform will facilitate collaborations with the goal of creating a set of tools that work with one another. Far from requiring integration into a single package, we believe that this kind of interoperability can also be achieved in a collaborative manner, which is especially suited to open source software development practices.

最后欢迎大家在我爱公开课上讨论和提供各种相关的开源工具包的信息,我们可以对相关的领域(包括但不限于机器学习,自然语言处理等)做一个汇总整理,方便大家的学习。

时间: 2012年 6月 29日 分类:开源项目 作者: 52opencourse (19,200 基本)
编辑 2012年 6月 30日 作者:52nlp

1个回答

0 投票

最好可以搞一系列源码剖析文章,或者介绍些成功的应用案例、经验与教训

已回复 2012年 6月 29日 作者: fandywang (2,360 基本)
NLPJob

TensorFlow Tutorial

Sentiment Analysis

Free Article Spinner

Text Analysis Online

Text Processing

本站架设在 DigitalOcean 上, 采用创作共用版权协议, 要求署名、非商业用途和保持一致. 转载本站内容必须也遵循“署名-非商业用途-保持一致”的创作共用协议.