`
weigang.gao
  • 浏览: 467773 次
  • 性别: Icon_minigender_1
  • 来自: 上海
文章分类
社区版块
存档分类
最新评论

hadoop problem

 
阅读更多

1. 启动hadoop2.5.2遇到的datanode启动不了

日志信息如下:


 简单的操作是将这个节点上hadoop工作目录中内存删除就行了

 

 

2.Hadoop开启关闭调试信息

开启:export HADOOP_ROOT_LOGGER=DEBUG,console

关闭:export HADOOP_ROOT_LOGGER=INFO,console

具体参考:http://bufeifang.blog.163.com/blog/static/55898151201232652239324/

 

3.中文分词器

http://blog.sina.com.cn/s/blog_7663527601012vdg.html

 

4.在window下eclipse导出的jar包依赖外部jar包,因此无法使用hadoop命令执行成功。

把hadoop依赖的外部jar文件都拷贝到JAVA_HOME/jre/lib/ext/下面,就可以了

 

5.hadoop常见问题总结

http://www.sharpcloud.cn/thread-4927-1-1.html

 

6.org.apache.hadoop.yarn.exceptions.YarnException: Unauthorized request to start container

问题原因:namenode,datanode时间同步问题

解决办法:多个datanode与namenode进行时间同步,在每台服务器执行:ntpdate time.nist.gov,确认时间同步成功。

最好在每台服务器的 /etc/crontab 中加入一行:

0 2 * * * root ntpdate time.nist.gov && hwclock -w

 

查看centOS时区,

 

7.mapReduce程序中如何添加自己的日志

 

8.在hadoop2.5中怎么找不到HADOOP_HOME/logs/userlogs怎么没有

 

9.官网MapReduce实例代码详细批注

http://www.linuxidc.com/Linux/2014-10/108194.htm

 

10.创建软链

 

 

11.Hive启动报错: Found class jline.Terminal, but interface was expected

http://www.linuxidc.com/Linux/2015-10/123822.htm

 

12.linux ps命令

 有时候系统管理员可能只关心现在系统中运行着哪些程序,而不想知道有哪些进程在运行。由于一个应用程序可能需要启动多个进程。所以在同等情况下,进程的数 量要比程序多的多。为此从阅读方面考虑,管理员需要知道系统中运行的具体程序。要实现这个需求的话,就需要利用命令ps来帮忙。

要对进程进行监测和控制,首先必须要了解当前进程的情况,也就是需要查看当前进程,而 ps 命令就是最基本同时也是非常强大的进程查看命令。使用该命令可以确定有哪些进程正在运行和运行的状态、进程是否结束、进程有没有僵死、哪些进程占用了过多 的资源等等。总之大部分信息都是可以通过执行该命令得到的。

参考:http://www.cnblogs.com/wangkangluo1/archive/2011/09/23/2185938.html 

 

 

 

  • 大小: 43.2 KB
分享到:
评论

相关推荐

    Scaling Big Data with Hadoop and Solr

    Bringing these two technologies together is helping organizations resolve the problem of information extraction from Big Data by providing excellent distributed faceted search capabilities. ...

    Hadoop in Practice

    Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face,

    hadoop 权威指南(第三版)英文版

    Decomposing a Problem into MapReduce Jobs JobControl Apache Oozie 6. How MapReduce Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187 Anatomy of...

    Big Data, MapReduce, Hadoop, and Spark with Python

    The problem is, the amount of data we collect has outpaced gains in CPU performance. Therefore, scalable methods for processing data are needed. In the early 2000s, Google invented MapReduce, a ...

    hadoop join implement

    Joins in Hadoop has always been a problem for its users: the Map/Reduce framework seems to be specifically designed for group-by aggregation tasks rather than across-table op- erations; on the other ...

    hadoop-eclipse-plugin-2.6.0.jar程序文件

    It takes me two days to find the root cause for the exceptions and finally generate this jar file for hadoop 2.6.0. 1.use hadoop 2.6.0. 2.change the hadoop_home and ...Any problem,please let me know.

    Hadoop in Practice(2012)

    Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log ...

    Scaling.Big.Data.with.Hadoop.and.Solr.2nd.Edition.1783553391

    Together, Apache Hadoop and Apache Solr help organizations resolve the problem of information extraction from big data by providing excellent distributed faceted search capabilities. This book will ...

    Data-intensive Systems: Principles and Fundamentals using Hadoop and Spark

    The material in the book is structured following a problem-based approach. This means that the content in the chapters is focused on developing solutions to simplified, but still realistic problems ...

    Pro Apache Hadoop(Apress,2ed,2014)

    Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating small-scale solutions that can be flung across thousands upon thousands of nodes to analyze ...

    hadoop_the_definitive_guide_3nd_edition

    Decomposing a Problem into MapReduce Jobs 180 JobControl 182 Apache Oozie 182 6. How MapReduce Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 187...

    hadoop-eclipse-plugin-2.5.2.jar

    Tested with eclipse jee luna R win32 x86 64 Any problem please give comments below Eclipse Java EE IDE for Web Developers Version: Luna Release 4 4 0 Build id: 20140612 0600

    hadoop-eclipse-plugin-2.4.1.jar

    try clean eclipse to see whether it works or not if any problem found.

    PySpark Recipes: A Problem-Solution Approach with PySpark2

    Content is presented in the popular problem-solution format. Look up the programming problem that you want to solve. Read the solution. Apply the solution directly in your own code. Problem solved! ...

    Hadoop3.2.0集群搭建常见注意事项

    hadoop-env.sh中不光需要配置java-home,还需要声明下面这些用户变量,不然无法启动: export JAVA_HOME=/usr/local/jdk/jdk1.8.0_191 export HDFS_NAMENODE_USER=root export HDFS_DATANODE_USER=ro

    nuclei-analysis:一个用于数据分析的 hadoop-gis 项目

    ###核分析一个用于数据分析的 hadoop-gis 项目。 #####Problem 陈述:给定一组 Si 的多边形(核),根据病理图像计算特征向量 Fi。 对功能的选择要有创意。 例子: 遮罩面积、周长、形状蒙版内纹理填充区域纹理开发...

    Spring Recipes A Problem-Solution Approach, 3rd Edition

    Spring Recipes: A Problem-Solution Approach, Third Edition builds upon the best-selling success of the previous editions and focuses on the latest Spring Framework features for building enterprise ...

Global site tag (gtag.js) - Google Analytics