统计类优化算法初步

sealbird

浏览: 572195 次
性别:
来自: 广州

最近访客更多访客>>

ladies_killer

wbsh583

u012363178

dilimic120

博主相关

博客

微博

相册

留言

关于我

文章分类

社区版块

存档分类

博客分类：

Lucene

算法 J#多线程

public class testcache {

	class A{
	 public	int []tagid;
	 public	int []tagvalueid;
	}
	public A [] tmpA;
	public  void test(){
//		  tmpA=new A[20000000];
		tmpA=new A[20000000];
		for (int i = 0; i < tmpA.length; i++) {
			tmpA[i]=new A();
			tmpA[i].tagid=new int[10];
			for (int j = 0; j < tmpA[i].tagid.length; j++) {
				tmpA[i].tagid[j]=j;
			}
			tmpA[i].tagvalueid=new int [10];
			
			for (int j = 0; j < tmpA[i].tagvalueid.length; j++) {
				tmpA[i].tagvalueid[j]=j;
			}
		}
	}
	/**
	 * @param args
	 */
	public static void main(String[] args) {
		// TODO Auto-generated method stub
		  System.out.println("freeMemory="+Runtime.getRuntime().freeMemory());   
		  System.out.println("totalMemory="+Runtime.getRuntime().totalMemory());   
		  System.out.println("maxMemory="+Runtime.getRuntime().maxMemory());
		  System.out.println("--------------------------------");
		long start=System.currentTimeMillis();
//		int [] tmp=new int[100000000];
//		for (int i = 0; i < tmp.length; i++) {
//			tmp[i]=i;
//		}
//		String [] tmp=new String[10000000];
//		for (int i = 0; i < tmp.length; i++) {
//			tmp[i]=""+i;
//		}
		
		BirdBitSet tmpBitSet1=new BirdBitSet();
		tmpBitSet1.set(1);
		tmpBitSet1.set(1000);
		tmpBitSet1.set(99999);
		System.out.println("tmpBitSet1.size()="+tmpBitSet1.size());
		
		testcache _testcache=new testcache();
		_testcache.test();
		System.out.println("time1:"+(System.currentTimeMillis()-start));
		BirdBitSet tmpBitSet=new BirdBitSet();
		
		start=System.currentTimeMillis();
		int tmpi=0;
		for (int i = 0; i < 10000000; i++) {
			tmpi=(int) Math.round(Math.random()*20000001);
 
			tmpBitSet.set(tmpi);
		}
		System.out.println("time2:"+(System.currentTimeMillis()-start));
		start=System.currentTimeMillis();
		int iranddom=0;
		
		for (int i = tmpBitSet.nextSetBit(0); i >=0; i=tmpBitSet.nextSetBit(i+1)) {
 
			int tmpicount=_testcache.tmpA[i].tagid.length;
			for (int j = 0; j < tmpicount; j++) {
				int tmp=_testcache.tmpA[i].tagid[j]+100;
			}
			int tmpicount1=_testcache.tmpA[i].tagvalueid.length;
			for (int j = 0; j < tmpicount; j++) {
				int tmp=_testcache.tmpA[i].tagvalueid[j]+100;
			}
		}
 
		System.out.println("iranddom:"+(iranddom));
		System.out.println("time3:"+(System.currentTimeMillis()-start));
		
		
		  System.out.println("freeMemory="+Runtime.getRuntime().freeMemory());   
		  System.out.println("totalMemory="+Runtime.getRuntime().totalMemory());   
		  System.out.println("maxMemory="+Runtime.getRuntime().maxMemory());
		  System.out.println("totalMemory-freeMemory[已经使用的内存]="+(Runtime.getRuntime().totalMemory()-Runtime.getRuntime().freeMemory())/1024/1024);

	}

}

结论

单线程 8核机器

2000w
随机取1000w
freeMemory=8588320
totalMemory=9109504
maxMemory=4727504896
--------------------------------
初始数据
time1:88881
模似数据
time2:1172
iranddom:0
//取值
time3:1324
freeMemory=670798848
totalMemory=4201250816
maxMemory=4727504896
totalMemory-freeMemory[已经使用的内存]=3366 (3.3G)

如果用多线程将取到很好的效果,16个核的机器估计可以支撑1亿记录数据量的聚类

分享到：

C语言插件机制(下) 转 | VS/NAT部署过程 VS/DR部署过程

2010-09-01 17:54
浏览 957
评论(0)
分类:编程语言
查看更多

发表评论

您还没有登录,请您登录后再发表评论

最近访客更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

统计类优化算法初步

评论

发表评论

相关推荐

最近访客 更多访客>>

博主相关

文章分类

社区版块

存档分类

最新评论

统计类优化算法初步

评论

发表评论

相关推荐

关于搜索聊天记录

亿级数据的高并发通用搜索引擎架构设计[

lucene2.32 and lucene3.02 搜索对比

Lucene3.0索引格式相关网址

一个简单索引的配置文件

百度分词算法探秘 获取优质长尾流量

取重网记

Lucene2.32升级到3.0 前期记录点

【Lucene3.0 初窥】索引文件格式

lucene搜索结果排序之Payload

自定义排序<1>

最近访客更多访客>>

百度分词算法探秘获取优质长尾流量