Back to Hadoop

Licensed to the Apache Software Foundation (ASF) under one

hadoop-common-project/hadoop-common/src/site/markdown/release/0.23.7/CHANGELOG.0.23.7.md

2.0.5-alpha-rc225.2 KB
Original Source
<!--- # Licensed to the Apache Software Foundation (ASF) under one # or more contributor license agreements. See the NOTICE file # distributed with this work for additional information # regarding copyright ownership. The ASF licenses this file # to you under the Apache License, Version 2.0 (the # "License"); you may not use this file except in compliance # with the License. You may obtain a copy of the License at # # http://www.apache.org/licenses/LICENSE-2.0 # # Unless required by applicable law or agreed to in writing, software # distributed under the License is distributed on an "AS IS" BASIS, # WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. # See the License for the specific language governing permissions and # limitations under the License. -->

Apache Hadoop Changelog

Release 0.23.7 - 2013-04-18

INCOMPATIBLE CHANGES:

JIRASummaryPriorityComponentReporterContributor
HDFS-395DFS Scalability: Incremental block reportsMajordatanode, namenodedhruba borthakurTomasz Nykiel

NEW FEATURES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-9209Add shell command to dump file checksumsMajorfs, toolsTodd LipconTodd Lipcon

IMPROVEMENTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-7358Improve log levels when exceptions caught in RPC handlerMinoripcTodd LipconTodd Lipcon
HADOOP-7886Add toString to FileStatusMinor.Jakob HomanSreeHari
HADOOP-8214make hadoop script recognize a full set of deprecated commandsMajorscriptsRoman ShaposhnikRoman Shaposhnik
HADOOP-8711provide an option for IPC server users to avoid printing stack information for certain exceptionsMajoripcBrandon LiBrandon Li
HDFS-3817avoid printing stack information for SafeModeExceptionMajornamenodeBrandon LiBrandon Li
HADOOP-8075Lower native-hadoop library log from info to debugMajornativeEli CollinsHızır Sefa İrken
HADOOP-9147Add missing fields to FIleStatus.toStringTrivial.Jonathan AllenJonathan Allen
MAPREDUCE-4907TrackerDistributedCacheManager issues too many getFileStatus callsMajormrv1, tasktrackerSandy RyzaSandy Ryza
HADOOP-9216CompressionCodecFactory#getCodecClasses should trim the result of parsing by Configuration.MajorioTsuyoshi OzawaTsuyoshi Ozawa
HADOOP-8849FileUtil#fullyDelete should grant the target directories +rwx permissions before trying to delete themMinor.Ivan A. VeselovskyIvan A. Veselovsky
MAPREDUCE-4458Warn if java.library.path is used for AM or TaskMajormrv2Robert Joseph EvansRobert Parker
MAPREDUCE-4949Enable multiple pi jobs to run in parallelMinorexamplesSandy RyzaSandy Ryza
HADOOP-9247parametrize Clover "generateXxx" properties to make them re-definable via -D in mvn callsMinor.Ivan A. VeselovskyIvan A. Veselovsky
MAPREDUCE-4822Unnecessary conversions in History EventsTrivialjobhistoryserverRobert Joseph EvansChu Tong
MAPREDUCE-4989JSONify DataTables input data for Attempts pageMajorjobhistoryserver, mr-amRavi PrakashRavi Prakash
YARN-249Capacity Scheduler web page should show list of active users per queue like it used to (in 1.x)MajorcapacityschedulerRavi PrakashRavi Prakash
HADOOP-9336Allow UGI of current connection to be queriedCriticalipcDaryn SharpDaryn Sharp
HADOOP-9352Expose UGI.setLoginUser for testsMajorsecurityDaryn SharpDaryn Sharp
HADOOP-8462Native-code implementation of bzip2 codecMajorioGovind KamatGovind Kamat
HADOOP-9374Add tokens from -tokenCacheFile into UGIMajorsecurityDaryn SharpDaryn Sharp
YARN-443allow OS scheduling priority of NM to be different than the containers it launchesMajornodemanagerThomas GravesThomas Graves
HADOOP-9379capture the ulimit info after printing the log to the consoleTrivial.Arpit GuptaArpit Gupta
MAPREDUCE-4990Construct debug strings conditionally in ShuffleHandler.Shuffle#sendMapOutput()Trivial.Karthik KambatlaKarthik Kambatla
HADOOP-9253Capture ulimit info in the logs at service start timeMajor.Arpit GuptaArpit Gupta
YARN-525make CS node-locality-delay refreshableMajorcapacityschedulerThomas GravesThomas Graves
MAPREDUCE-5079Recovery should restore task state from job history info directlyCriticalmr-amJason LoweJason Lowe

BUG FIXES:

JIRASummaryPriorityComponentReporterContributor
MAPREDUCE-3952In MR2, when Total input paths to process == 1, CombinefileInputFormat.getSplits() returns 0 split.Majormrv2Zhenxiao LuoBhallamudi Venkata Siva Kamesh
HDFS-1765Block Replication should respect under-replication block priorityMajornamenodeHairong KuangUma Maheswara Rao G
MAPREDUCE-4007JobClient getJob(JobID) should return NULL if the job does not exist (for backwards compatibility)Majormrv2Alejandro AbdelnurAlejandro Abdelnur
HADOOP-6941Support non-SUN JREs in UserGroupInformationMajor.Stephen WattDevaraj Das
HADOOP-8251SecurityUtil.fetchServiceTicket broken after HADOOP-6941BlockersecurityTodd LipconTodd Lipcon
HDFS-3119Overreplicated block is not deleted even after the replication factor is reduced after sync follwed by closing that fileMinornamenodeJ.AndreinaAshish Singhi
HDFS-3256HDFS considers blocks under-replicated if topology script is configured with only 1 rackMajor.Aaron T. MyersAaron T. Myers
HADOOP-8346Changes to support Kerberos with non Sun JVM (HADOOP-6941) broke SPNEGOBlockersecurityAlejandro AbdelnurDevaraj Das
MAPREDUCE-4470Fix TestCombineFileInputFormat.testForEmptyFileMajortestKihwal LeeIlya Katsov
YARN-83Change package of YarnClient to include apacheMajorclientBikas SahaBikas Saha
YARN-40Provide support for missing yarn commandsMajorclientDevaraj KDevaraj K
YARN-150AppRejectedTransition does not unregister app from master service and schedulerMajor.Bikas SahaBikas Saha
HADOOP-8878uppercase namenode hostname causes hadoop dfs calls with webhdfs filesystem and fsck to fail when security is onMajor.Arpit GuptaArpit Gupta
HDFS-4072On file deletion remove corresponding blocks pending replicationMinornamenodeJing ZhaoJing Zhao
HDFS-2434TestNameNodeMetrics.testCorruptBlock fails intermittentlyMajortestUma Maheswara Rao GJing Zhao
MAPREDUCE-4637Killing an unassigned task attempt causes the job to failMajormrv2Tom WhiteMayank Bansal
MAPREDUCE-4278cannot run two local jobs in parallel from the same gateway.Major.Araceli HenleySandy Ryza
HADOOP-9155FsPermission should have different default value, 777 for directory and 666 for fileMinor.Binglin ChangBinglin Chang
HADOOP-8816HTTP Error 413 full HEAD if using kerberos authenticationMajornetMoritz MoellerMoritz Moeller
HADOOP-9212Potential deadlock in FileSystem.Cache/IPC/UGIMajorfsTom WhiteTom White
HADOOP-9193hadoop script can inadvertently expand wildcard arguments when delegating to hdfs scriptMinorscriptsJason LoweAndy Isaacson
MAPREDUCE-4946Type conversion of map completion events leads to performance problems with large jobsCriticalmr-amJason LoweJason Lowe
HADOOP-9231Parametrize staging URL for the uniformity of distributionManagementMajorbuildKonstantin BoudnikKonstantin Boudnik
HDFS-4426Secondary namenode shuts down immediately after startupBlockernamenodeJason LoweArpit Agarwal
YARN-354WebAppProxyServer exits immediately after startupBlocker.Liang XieLiang Xie
HADOOP-8857hadoop.http.authentication.signature.secret.file docs should not state that secret is randomly generatedMinorsecurityEli CollinsAlejandro Abdelnur
HDFS-4444Add space between total transaction time and number of transactions in FSEditLog#printStatisticsTrivial.Stephen ChuStephen Chu
YARN-133update web services docs for RM clusterMetricsMajorresourcemanagerThomas GravesRavi Prakash
HADOOP-9190packaging docs is brokenMajordocumentationThomas GravesAndy Isaacson
HDFS-4288NN accepts incremental BR as IBR in safemodeCriticalnamenodeDaryn SharpDaryn Sharp
HADOOP-9221Convert remaining xdocs to APTMajor.Andy IsaacsonAndy Isaacson
YARN-343Capacity Scheduler maximum-capacity value -1 is invalidMajorcapacityschedulerThomas GravesXuan Gong
MAPREDUCE-4893MR AppMaster can do sub-optimal assignment of containers to map tasks leading to poor node localityMajorapplicationmasterBikas SahaBikas Saha
MAPREDUCE-4969TestKeyValueTextInputFormat test fails with Open JDK 7MajortestArpit AgarwalArpit Agarwal
YARN-364AggregatedLogDeletionService can take too long to delete logsMajor.Jason LoweJason Lowe
HDFS-44622NN will fail to checkpoint after an HDFS upgrade from a pre-federation version of HDFSMajornamenodeAaron T. MyersAaron T. Myers
YARN-360Allow apps to concurrently register tokens for renewalCritical.Daryn SharpDaryn Sharp
MAPREDUCE-4953HadoopPipes misuses fprintfMajorpipesAndy IsaacsonAndy Isaacson
HADOOP-9278HarFileSystem may leak file handleMajorfsChris NaurothChris Nauroth
YARN-357App submission should not be synchronizedMajorresourcemanagerDaryn SharpDaryn Sharp
YARN-355RM app submission jams under loadBlockerresourcemanagerDaryn SharpDaryn Sharp
HADOOP-9289FsShell rm -f fails for non-matching globsBlockerfsDaryn SharpDaryn Sharp
MAPREDUCE-4671AM does not tell the RM about container requests that are no longer neededMajor.Bikas SahaBikas Saha
HADOOP-9124SortedMapWritable violates contract of Map interface for equals() and hashCode()MinorioPatrick HuntSurenkumar Nihalani
YARN-362Unexpected extra results when using webUI table searchMinor.Jason LoweRavi Prakash
HADOOP-9302HDFS docs not linked from top levelMajordocumentationThomas GravesAndy Isaacson
HADOOP-9303command manual dfsadmin missing entry for restoreFailedStorage optionMajor.Thomas GravesAndy Isaacson
MAPREDUCE-5000TaskImpl.getCounters() can return the counters for the wrong task attempt when task is speculatingCriticalmr-amJason LoweJason Lowe
HDFS-4495Allow client-side lease renewal to be retried beyond soft-limitMajorhdfs-clientKihwal LeeKihwal Lee
HADOOP-9154SortedMapWritable#putAll() doesn't add key/value classes to the mapMajorioKarthik KambatlaKarthik Kambatla
MAPREDUCE-5009Killing the Task Attempt slated for commit does not clear the value from the Task commitAttempt memberCriticalmrv1Robert ParkerRobert Parker
YARN-400RM can return null application resource usage report leading to NPE in clientCriticalresourcemanagerJason LoweJason Lowe
YARN-236RM should point tracking URL to RM web page when app fails to startMajorresourcemanagerJason LoweJason Lowe
HDFS-4222NN is unresponsive and loses heartbeats of DNs when Hadoop is configured to use LDAP and LDAP has issuesMinornamenodeXiaobo PengXiaobo Peng
YARN-426Failure to download a public resource on a node prevents further downloads of the resource from that nodeCriticalnodemanagerJason LoweJason Lowe
HADOOP-9339IPC.Server incorrectly sets UGI auth typeMajoripcDaryn SharpDaryn Sharp
MAPREDUCE-4871AM uses mapreduce.jobtracker.split.metainfo.maxsize but mapred-default has mapreduce.job.split.metainfo.maxsizeMajormrv2Jason LoweJason Lowe
YARN-269Resource Manager not logging the health_check_script result when taking it outMajorresourcemanagerThomas GravesJason Lowe
YARN-376Apps that have completed can appear as RUNNING on the NM UIBlockerresourcemanagerJason LoweJason Lowe
YARN-410New lines in diagnostics for a failed app on the per-application page make it hard to readMajor.Vinod Kumar VavilapalliOmkar Vinit Joshi
MAPREDUCE-4794DefaultSpeculator generates error messages on normal shutdownMajorapplicationmasterJason LoweJason Lowe
HDFS-41282NN gets stuck in inconsistent state if edit log replay fails in the middleMajornamenodeTodd LipconKihwal Lee
HDFS-4532RPC call queue may fill due to current user lookupCriticalnamenodeDaryn SharpDaryn Sharp
YARN-448Remove unnecessary hflush from log aggregationMajornodemanagerKihwal LeeKihwal Lee
MAPREDUCE-5043Fetch failure processing can cause AM event queue to backup and eventually OOMBlockermr-amJason LoweJason Lowe
YARN-345Many InvalidStateTransitonException errors for ApplicationImpl in Node ManagerCriticalnodemanagerDevaraj KRobert Parker
HDFS-4553Webhdfs will NPE on some unexpected response codesMajorwebhdfsDaryn SharpDaryn Sharp
YARN-227Application expiration difficult to debug for end-usersMajorresourcemanagerJason LoweJason Lowe
MAPREDUCE-5027Shuffle does not limit number of outstanding connectionsMajor.Jason LoweRobert Parker
MAPREDUCE-3685There are some bugs in implementation of MergeManagerCriticalmrv2anty.raoanty
HDFS-4544Error in deleting blocks should not do check disk, for all types of errorsMajor.Amareshwari SriramadasuArpit Agarwal
MAPREDUCE-5023History Server Web Services missing Job CountersCriticaljobhistoryserver, webappsKendall ThrappRavi Prakash
MAPREDUCE-5060Fetch failures that time out only count against the first map taskCritical.Robert Joseph EvansRobert Joseph Evans
HDFS-3344Unreliable corrupt blocks counting in TestProcessCorruptBlocksMajornamenodeTsz Wo Nicholas SzeKihwal Lee
HADOOP-9406hadoop-client leaks dependency on JDK tools jarMajorbuildAlejandro AbdelnurAlejandro Abdelnur
MAPREDUCE-5042Reducer unable to fetch for a map task that was recoveredBlockermr-am, securityJason LoweJason Lowe
MAPREDUCE-5053java.lang.InternalError from decompression codec cause reducer to failMajor.Robert ParkerRobert Parker
MAPREDUCE-5075DistCp leaks input file handlesMajordistcpChris NaurothChris Nauroth
YARN-109.tmp file is not deleted for localized archivesMajornodemanagerJason LoweMayank Bansal
HDFS-4581DataNode#checkDiskError should not be called on network errorsMajordatanodeRohit KocharRohit Kochar
YARN-460CS user left in list of active users for the queue even when application finishedBlockercapacityschedulerThomas GravesThomas Graves
MAPREDUCE-4992AM hangs in RecoveryService when recovering tasks with speculative attemptsCriticalmr-amRobert ParkerRobert Parker
HDFS-4649Webhdfs cannot list large directoriesBlockernamenode, security, webhdfsDaryn SharpDaryn Sharp
MAPREDUCE-5137AM web UI: clicking on Map Task results in 500 errorMajorapplicationmasterThomas GravesThomas Graves

TESTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-8157TestRPCCallBenchmark#testBenchmarkWithWritable fails with RTEMajor.Eli CollinsTodd Lipcon
HADOOP-9067provide test for method org.apache.hadoop.fs.LocalFileSystem.reportChecksumFailure(Path, FSDataInputStream, long, FSDataInputStream, long)Minor.Ivan A. VeselovskyIvan A. Veselovsky
MAPREDUCE-4905test org.apache.hadoop.mapred.pipesMajor.Aleksey GorshkovAleksey Gorshkov
MAPREDUCE-4972Coverage fixing for org.apache.hadoop.mapreduce.jobhistoryMajor.Aleksey GorshkovAleksey Gorshkov
MAPREDUCE-4875coverage fixing for org.apache.hadoop.mapredMajortestAleksey GorshkovAleksey Gorshkov
MAPREDUCE-4991coverage for gridmixMajor.Aleksey GorshkovAleksey Gorshkov
MAPREDUCE-5007fix coverage org.apache.hadoop.mapreduce.v2.hsMajor.Aleksey GorshkovAleksey Gorshkov

SUB-TASKS:

JIRASummaryPriorityComponentReporterContributor
HDFS-2477Optimize computing the diff between a block report and the namenode state.MajornamenodeTomasz NykielTomasz Nykiel
HDFS-2495Increase granularity of write operations in ReplicationMonitor thus reducing contention for write lockMajornamenodeTomasz NykielTomasz Nykiel
HDFS-2476More CPU efficient data structure for under-replicated/over-replicated/invalidate blocksMajornamenodeTomasz NykielTomasz Nykiel
YARN-29Add a yarn-client moduleMajorclientVinod Kumar VavilapalliVinod Kumar Vavilapalli
HDFS-4542Webhdfs doesn't support secure proxy usersBlockerwebhdfsDaryn SharpDaryn Sharp
HDFS-4560Webhdfs cannot use tokens obtained by another userMajorwebhdfsDaryn SharpDaryn Sharp
HDFS-4566Webdhfs token cancelation should use authenticationMajorwebhdfsDaryn SharpDaryn Sharp
HDFS-4567Webhdfs does not need a token for token operationsMajorwebhdfsDaryn SharpDaryn Sharp
HDFS-4577Webhdfs operations should declare if authentication is requiredMajorwebhdfsDaryn SharpDaryn Sharp
YARN-468coverage fix for org.apache.hadoop.yarn.server.webproxy.amfilterMajor.Aleksey GorshkovAleksey Gorshkov
YARN-200yarn log does not output all needed information, and is in a binary formatMajor.Robert Joseph EvansRavi Prakash
HDFS-4548Webhdfs doesn't renegotiate SPNEGO tokenBlocker.Daryn SharpDaryn Sharp