Apache Pig 0.10.1已经发布,这个版本并没有什么特性,更多的是为了稳定性以及一些bug的修正
显著变化:
1. 代码独立以及支持多种格式的下载.
下载地址: http://www.fightrice.com/mirrors/apache/pig/pig-0.10.1/
可以通过tar.gz,rpm,deb等方式下载获取
2. 更好的支持 Apache Hadoop 0.23.x/2.x
从0.10.0开始,pig就已经加入了Maven仓库.你可以通过以下方式配置pom.xml来使用pig0.10.1
<dependency> <groupId>org.apache.pig</groupId> <artifactId>pig</artifactId> <version>0.10.1</version> <classifier>h2</classifier> </dependency>
同时Pig团队修正了Pig运行在Hadoop 0.23X/2.X版本上的一些Bug( PIG-3035, PIG-2783, PIG-2761, PIG-2912, PIG-2791)
3.更好的支持Oracle JDK7.0
4.单元测试的错误修正
具体修改如下:
IMPROVEMENTS
PIG-2907: Publish pig jars for Hadoop2/23 to maven (rohini)
PIG-3019: Need a target in build.xml for source releases (gates)
PIG-2794: Pig test: add utils to simplify testing on Windows (jgordon via gates)
PIG-2908: Fix unit tests to work with jdk7 (rohini via dvryaboy)
PIG-2852: Update documentation regarding parallel local mode execution (cheolsoo via jcoveney)
PIG-2712: Pig does not call OutputCommitter.abortJob() on the underlying OutputFormat (rohini via gates)
PIG-2727: PigStorage Source tagging does not need pig.splitCombination to be turned off (prkommireddi via dvryaboy)
PIG-2711: e2e harness: cache benchmark results between test runs (thw via daijy)
PIG-2680: TOBAG output schema reporting (andy schlaikjer via jcoveney)
PIG-2650: Convenience mock Loader and Storer to simplify unit testing of Pig scripts (julien)
BUG FIXES
PIG-3107: bin and autocomplete are missing in src release (daijy)
PIG-3106: Missing license header in several java file (daijy)
PIG-3099: Pig unit test fixes for TestGrunt(1), TestStore(2), TestEmptyInputDir(3) (vikram.dixit via daijy)
PIG-3035: With latest version of hadoop23 pig does not return the correct exception stack trace from backend (rohini)
PIG-2953: "which" utility does not exist on Windows (daijy)
PIG-2960: Increase the timeout for unit test (daijy)
PIG-2958: Pig tests do not appear to have a logger attached (daijy)
PIG-2942: DevTests, TestLoad has a false failure on Windows (jgordon via daijy)
PIG-2943: DevTests, Refactor Windows checks to use new Util.WINDOWS method for code health (jgordon via dvryaboy)
PIG-2801: grunt "sh" command should invoke the shell implicitly instead of calling exec directly with the command tokens
(jgordon via daijy)
PIG-2800: pig.additional.jars path separator should align with File.pathSeparator instead of being hard-coded to ":"
(jgordon via azaroth)
PIG-2798: pig streaming tests assume interpreters are auto-resolved (jgordon via daijy)
PIG-2797: Tests should not create their own file URIs through string concatenation, should use Util.generateURI
instead (jgordon via daijy)
PIG-2796: Local temporary paths are not always valid HDFS path names (jgordon via daijy)
PIG-2795: Fix test cases that generate pig scripts with "load " + pathStr to encode "\" in the path (jgordon via daijy)
PIG-2940: HBaseStorage store fails in secure cluster (cheolsoo via daijy)
PIG-2821: HBaseStorage should work with secure hbase (rohini via daijy)
PIG-2890: Revert PIG-2578 (dvryaboy)
PIG-2859: Fix few e2e test failures (rohini via daijy)
PIG-2729: Macro expansion does not use pig.import.search.path - UnitTest borked (johannesch via daijy)
PIG-2791: Pig does not work with Namenode Federation (rohini via daijy)
PIG-2783: Fix Iterator_1 e2e test for Hadoop 23 (rohini via daijy)
PIG-2761: With hadoop23 importing modules inside python script does not work (rohini via daijy)
PIG-2759: Typo in document "Built In Functions" (daijy)
PIG-2745: Pig e2e test RubyUDFs fails in MR mode when running from tarball (cheolsoo via daijy)
PIG-2741: Python script throws an NameError: name 'Configuration' is not defined in case cache dir is not created
(knoguchi via daijy)
PIG-2669: Pig release should include pig-default.properties after rebuild (daijy)
PIG-2739: PyList should map to Bag automatically in Jython (daijy)
PIG-2730: TFileStorage getStatistics incorrectly throws an exception instead of returning null (traviscrawford via daijy)
PIG-2717: Tuple field mangled during flattening (daijy)
PIG-2721: Wrong output generated while loading bags as input (knoguchi via daijy)
PIG-2912: Pig should clone JobConf while creating JobContextImpl and TaskAttemptContextImpl in Hadoop23 (rohini via daijy)
PIG-2775: Register jar does not goes to classpath in some cases (daijy)
粗鲁的翻译自:http://hortonworks.com/blog/apache-pig-0-10-1-released/
更多技术文章、感悟、分享、勾搭,请用微信扫描:
相关推荐
Tez是Hadoop生态系统中的一个开源任务执行框架,它由Apache Software Foundation开发,用于提高Hive、Pig等数据处理工具的性能。 【描述】"tez-0.10.1-SNAPSHOT-minimal.tar.gz" 指示这是一个特定版本(0.10.1)的...
Tez是Apache开源的支持DAG作业的计算框架,它直接源于...(1)Apache二级开源项目(源代码今天发布的) (2)运行在YARN之上 (3) 适用于DAG(有向图)应用(同Impala、Dremel和Drill一样,可用于替换Hive/Pig等
5. **集成环境**:除了Hive,还支持Spark、Flink、Pig等大数据处理框架,以及Jupyter Notebook的IPython解释器,提供丰富的数据分析环境。 6. **配置自定义**:用户可以根据需求自定义解释器设置,如连接Hive服务器...
scratch少儿编程逻辑思维游戏源码-足球冠军.zip
实训商业源码-【超人】积分商城 5.2.26-毕业设计.zip
windows,C# WInform,Excel操作类
这个函数可以直接把tdms文件的数据直接储存为MAT文件,并显示在工作区.具体的使用教程可以参考这篇文章。https://blog.csdn.net/weixin_41968197/article/details/106287744
φ3.2m两段式煤气发生炉CAD工艺流程图.rar
实训商业源码-daohang-毕业设计.zip
内容概要:文章深入探讨了在汽车电子电气架构中,网关ECU采用多CPU解决方案实现网关功能的技术背景、设计原则、挑战与解决方案、典型场景及其未来发展趋势。随着汽车电子电气架构向高集成化、智能化发展,网关ECU需处理复杂的多协议数据转发、协议转换、安全过滤等任务,单CPU难以满足需求。多CPU解决方案通过合理的架构设计,确保满足时延、吞吐量、资源管理等核心要求。文中详细介绍了多CPU架构的核心设计原则,包括内部接口设计、资源协同与容错设计,并针对典型场景提出了具体解决方案。此外,文章还讨论了多CPU架构面临的挑战及应对策略,并展望了未来的发展趋势,如异构计算融合、区域网关、确定性以太网等。; 适合人群:从事汽车电子工程领域的工程师和技术人员,尤其是对网关ECU设计和多CPU架构感兴趣的读者。; 使用场景及目标:①理解网关ECU中多CPU解决方案的必要性和优势;②掌握多CPU架构设计的关键技术和最佳实践;③了解多CPU架构在汽车电子电气架构中的应用前景和发展方向。; 其他说明:本文不仅提供了技术层面的深度解析,还强调了长期主义的重要性,鼓励读者在技术追求的同时保持对自身成长的关注。
实训商业源码-ddos压力测试-毕业设计.zip
JavaScript语言教程&案例&相关项目资源,奖励仅限VIP资源
实训商业源码-超级图文最新资源包及上传位置说明-毕业设计.zip
X-Y数控工作台机电系统设计.rar
实训商业源码-活码自动进群V1.0.0 开源版-毕业设计.zip
YZ8H压路机振动轮设计.rar
PE400x600复摆颚式破碎机设计.rar
scratch少儿编程逻辑思维游戏源码-像素森林.zip
scratch少儿编程逻辑思维游戏源码-躁狂症 RPG游戏.zip
scratch少儿编程逻辑思维游戏源码-战争模拟器(1).zip