Soil Classification and Fertilizer Recommendation using WEKA

Similar documents
Soil Test Report. HOME GARDEN VEGETABLE GARDEN Analysis Results

Class 3: Soil Sampling and Testing. Chris Thoreau

Effect on Soil Quality and Plants on Utilization of Domestic and paper mill sludge in Farm Lands

Research on Decision Tree Application in Data of Fire Alarm Receipt and Disposal

SOIL SAMPLING AND ANALYSIS

Table 4. Nutrient uptake and removal by sunflower in Manitoba studies. Nutrient Uptake Removal Uptake Removal

Developing Fertilizer Programs for Fruit Crops Utilizing Soil and Tissue Analysis Soil analysis

ACTIVITY 2 How Does Your Garden Grow?

Fertilizers and nutrient management for hops. Diane Brown, Michigan State University Extension

Understanding Your Virginia Soil Test Report

PASTURE AND HAY FIELDS: SOIL FUNDAMENTALS. Sanders County April 8, Clain Jones

THE USE OF A SMALL HYDROPONIC SYSTEM FOR PRODUCING MARIGOLDS

Nutrient Management And Nutrient Cycling Raymond C. Ward, President Ward Laboratories, Inc Kearney, NE

ACHIEVEMENT LEVEL DESCRIPTORS

Vine Nutrition. A g e n d a 4/10/2017. Soil How to sample Sample submission sheet Lab analysis & results Interpretation

Know Your Soil Getting the Dirt on Your Dirt FWAA. Dr. Steve Petrie Director of Agronomic Services

Data Mining. Brad Morantz PhD.

Nutrient Management of Irrigated Alfalfa and Timothy

Developing and Implementing a Fertilizer Program. Marc van Iersel. mixed with substrate components before planting

Nutrient Management for Tree Fruit. Mary Concklin Visiting Extension Educator Fruit Production and IPM University of Connecticut

Management of Sodic Soils in Alberta

High Carbon Wood Fly Ash as a Biochar Soil Amendment

Soil Fertility & Fertilizers

The Dirt on Soil Science

Ordered Fuzzy ARTMAP: A Fuzzy ARTMAP algorithm with a fixed order

Nutrient Considerations for Olives

The Plastic Plant. Root Architectural Responses to Limited and Altered Nutrient Availability. Bahar Aciksöz May 17 th 2018, Rome

Improving rail network velocity: A machine learning approach to predictive maintenance

Monitoring Nutrition for Crops

Screening of Mulberry Genotypes for Alkali Tolerance- An Integrated Approach

Soils and Fertilizers. Leo Espinoza Soils Specialist

Soil Fertility Note 14 Topsoil

Effect of Storage Duration in the Quality of Vermicompost

FOLIAR FEEDING and SAR for CITRUS TREES. Mongi Zekri and Gary England University of Florida, IFAS

8. Fertility Management

The basic functions of tree root systems are to absorb water

Plant Tissue Testing as a Guide to Side-Dressing Sugar Beets 1

Nutrient Deficiencies

Understanding Soil Variability to Utilize Variable Rate Fertilizer Technology

An Alarm Correlation Algorithm for Network Management Based on Root Cause Analysis

Applied Data Science: Using Machine Learning for Alarm Verification

Managing Phosphorus to Optimize Potato Tuber Yield in the San Luis Valley

Soil test recommendations

Soil: We Can t Grow without it!

Fire Detection Using Image Processing

Researches concerning the impact of some soil maintenance systems upon fruits quality of Generos apple tree variety

Soil. Acidic soils... 1/19/2014

Brian Arnall Oklahoma State Univ. Dept. Plant and Soil Science

DO YOU KNOW YOUR SOILS? (Rev. 10/11)

Making Sense of Soil Tests

PHOSPHORUS AVAILABILITY IN RELATION TO SOIL ph IN CANE BELT AREAS OF FIJI. Nazeea Bano, Milika Vaniqi, Merlyn Goundar

Sunlight. Chlorophyll

Were Soil Test Levels Affected by 2012 Drought?

How to Read a Soil Test Report: Step by Step

FERTILITY MANAGEMENT FOR TOMATOES AND PEPPERS

Alternative Crop Suitability Methodology

Slide 1: Welcome to today s From the Dirt Up presentation, where I ll be talking to you about the importance of soil testing and maintaining healthy

What s in Your Media? Analysis of media components for micronutrient content

Fuchsia: Lower Leaf Purpling

Keywords: Soil Properties, Crop Productivity, Non Fertile, Management, Contaminants, Land use, Nutrition.

The Mode of Urban Renewal Base on the Smart City Theory under the Background of New Urbanization

Anorganic Fertilizer. Lenny Sri Nopriani, SP.MP

Controlled Release Container Nursery Fertilizer Evaluations. Dr. James T. Midcap

DIRT! APES Laboratory Activity

THE INVESTIGATION BEGINS!

Humisphere SOIL RECONNAISSANCE MISSION TO 3 ESTATES IN TANZANIA. Ir. J.H.M. (Hans) Scholten

LEAF & SOIL SAMPLING AND ANALYSIS TO ADJUST CITRUS FERTILIZER PROGRAMS. Mongi Zekri

ISSN: [Vuvaraju* et al., 6(5): May, 2017] Impact Factor: 4.116

Lysimachia: Lower Leaf Purplish-Black Spotting

THE EFFECTS OF HUMATE AND ORGANIC FERTILIZER ON ESTABLISHMENT AND NUTRITION OF CREEPING BENT PUTTING GREENS

REVIEW OF AVOCADO FERTILIZER PRACTICES IN SAN DIEGO COUNTY

Alfalfa Management For Saline Soils. Dr. Don Miller Dir. of Product Development/Plant Breeder

Spring Citrus Meeting Thursday, April 17, :30 to 11:15 A.M.

Irrigation and Fertilization. Mary M. Peet North Carolina State University

Experimental Study on Utilization of E -Waste in Cement Concrete

Appropriate Analyses for New Mexico Soils and Interpreting Soil Test Results

EFFECT OF PHOSPHATIC FERTILIZER RATES LEVEL ON GROWTH, CRUDE PROTEIN CONTENT AND NITROGEN FIXATION ABILITY OF ALFALFA AT SOWING YEAR

POUR THRU TESTING OF CONTAINER MEDIA

The Effects of Soil Acidity on Phosphorus Uptake. by Vegetable Crops in Hestern Oregon- W. A. Sheets,-

#3: Fertilize Appropriately

Assessing and Amending Your Garden Soil Craig Cogger, Soil Scientist Emeritus Washington State University Puyallup

Relationships between soil characters and nutrients uptake of three sugar beet varieties grown in newly reclaimed soil

Greenhouse Plant Nutrition

Soil ph. Evaluation copy

Getting the Most out of Your Strawberry Soil Test Report. General Information

Lesco Fertilizer Evaluation

Technology Transfer of Greenhouse Aeroponic Lettuce Production Information to Alberta Growers,

Unearthing problem soils in the Lachlan Valley

SimpleWater, Inc. Soil, Water, Air Laboratory Sciences 1860 Leroy Ave, Berkeley, CA 94720

EFFECT OF PLANT GROWTH REGULATORS AND STAGES OF SPRAY ON SEED QUALITY OF RIDGE GOURD (Luffa acutangula L. ROXB)

SOIL RESOURCES STUDY NOTES INTRODUCTION: DEFINITION, NATURE AND FUNCTIONS OF SOIL

Supporting Information

Effects of Phosphorus and Calcium on Tuber Set, Yield, and Quality in Goldrush Potato

Farmers need to develop an understanding

Managing Avocado Fertilization and Irrigation Practices for Improved Yields and Fruit Quality

SECTION SOIL PREPARATION

Performance Neuro-Fuzzy for Power System Fault Location

EE04 804(B) Soft Computing Ver. 1.2 Class 1. Introduction February 21st,2012. Sasidharan Sreedharan

BNWAT28: Water consumption in new and existing homes

Workgroup. UCD Alfalfa. Utilizing Plant Tissue Testing & Application Methods to Maximize Fertilizer Efficiency

Transcription:

Soil Classification and Fertilizer Recommendation using WEKA Suman 1, Bharat Bhushan Naib 2 1 Research Scholar, CSE, P.D.M. College of Engineering, Bahadurgarh, Haryana (India) sumansehrawat121@gmail.com 2 Assistant Professor, CSE, P.D.M. College of Engineering, Bahadurgarh, Haryana (India) bharatnaib@gmail.com Abstract Agriculture forms the backbone of any country economy, since a large population lives in rural areas and they directly or indirectly dependent on agriculture for living. The use of standard statistical analysis techniques is both time consuming and expensive. Agricultural research has been profited by technical advances such as automation, data mining. Data mining in agriculture is a novel research field of computer science. Efficient techniques can be developed for solving complex data sets using data mining to improve the accuracy and effectiveness of classification of large soil data sets. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use. A soil test is the analysis of a soil sample to determine nutrient content, composition and other characteristics. Tests are usually performed to measure fertility and indicate deficiencies that need to be remedied. In this research, soil dataset containing soil test results has been used to apply various classification techniques in data mining. Soil classification deals with the categorization of soil into different soil classes as very low, low, medium, high, and very high on the basis of % of nutrient found in the soil and on the basis of these classes fertilizer is recommended for a soil sample. Keywords: Soil Dataset, Classification, Recommendation, Agriculture. 1. Introduction Agriculture was the key development in the rise of sedentary human civilization, whereby farming of domesticated species created food surpluses that nurtured the development of civilization. The study of agriculture is known as agricultural science. Soil may be defined as a thin layer of earth s crust which serves as a natural medium for the growth of plants. A study of soil profile supplemented by physical, chemical and biological properties of the soil will give full picture of soil fertility and productivity. Soil 142 can be classified on the basis of taxonomy, ph, liming, nutrients, and organic matters. Data mining plays a crucial role in the Agricultural research [2]. It focuses on classification and clustering of soil using various techniques like artificial neural networks, decision trees, genetic algorithms, nearest neighbor s method etc. WEKA is helpful in learning the basic concepts of data mining where we can apply different options and analyze the output that is being produced[5][16]. This paper is organized as follows. Section 2 includes an analysis of soil fertility using data mining techniques. Section 3 includes the results using WEKA. Section 4 includes the recommendation of the appropriate fertilizer based on fertility level. Section 5 include conclusion. 2. An Analysis of Soil Fertility Using Data Mining Techniques This section include the collection of soil dataset which can be helpful for the classification of soil into different fertility classes using data mining techniques on the basis of percentage of nutrients found in the soil. 2.1 Soil Dataset An intimate knowledge of the soil is pre-requisite for agricultural planning and development programs. The soil dataset is collected as a part of survey and this data is obtained from field sampling. The study of physical, chemical and biological properties of the soil sample will give the knowledge about soil fertility, productivity and deficiencies that need to be remedied. For the WEKA tool the data sets need to

be in the ARFF format. The data sets used for the tests come from the soil testing laboratory bahadurgarh, Jhajjar with the help of Dr. Jain Singh Maan. Based on the field and laboratory data collected for each soil sample contain 10 attribute which are needed for plant growth are shown in table. Field N P K S Fe Cu Zn B ph EC Table 1 : Attribute Description 2.2 Soil Classification Description Nitrogen, ppm Phosphorous, ppm Potassium, ppm Sulfur, ppm Iron, ppm Copper, ppm Zinc, ppm Boron, ppm ph value of soil Electrical Conductivity, mmhos/cm Soil classification deals with the categorization of soil into different soil fertility classes as very low, low, medium, high, and very high on the basis of % of nutrient found in the soil. Data mining algorithms Naïve Bayes, K- Nearest neighbour,c4.5 and cla used for classification of soil are as follows: 2.2.1 Naïve Bayes Bayesian Classifiers are statistical classifiers based on bayes theorem. Bayesian classification shows high accuracy and speed when applied to large data bases. In this classifier it is assume that the effect of an attribute value on a given class is independent of the values of the other attributes. This assumption is called class conditional independence. This classification can predict the probability that a given tuple belongs to a particular class [6][7][8]. 2.2.2 K- Nearest Neighbour (KNN) K-Nearest neighbor algorithm (KNN) is one of the supervised learning algorithms. It is based on a distance function for pairs of observations, such as the Euclidean distance or Cosine. Nearest Neighbor search also known as proximity search, similarity search or closest point search is an optimization problem for finding closest points in metric spaces. In this paradigm, k nearest neighbors of a training data is computed first. Then the similarities of one sample from testing data to the k nearest neighbors are aggregatedaccording to the class of the neighbors and the testing sample is assigned to the most similar class [10][11]. 2.2.3 J48 (C4.5) J48 is an open source Java implementation of the C4.5 algorithm in the Weka data mining tool. C4.5 is a program that creates a decision tree based on a set of labeled input data. This algorithm was developed by Ross Quinlan. The decision trees generated by C4.5 can be used for classification, and for this reason, C4.5 is often referred to as a statistical classifier ( C4.5 (J48), Wikipedia).At each node of the tree, C4.5 chooses one attribute of the data that most effectively splits its set of samples into subsets enriched in one class or the other. Its criterion is the normalized information gain that results from choosing an attribute for splitting the data. The attribute with the highest normalized information gain is chosen to make the decision [9][1]. 2.2.3.1 J48 Classification 1. A flow-chart-like tree structure Internal node denotes a test on an attribute. Branch represents an outcome of the test. Leaf nodes represent class labels or class distribution. 2. Decision tree generation consists of two Phases: Tree construction: At start, all the training examples are at the root and partition examples recursively based on selected attributes. Tree pruning : Identify and remove branches that reflect noise or outliers 3. Use of decision tree: Classifying an unknown sample Test the attribute values of the sample against the decision tree [12]. 143

Classifier Table 2: Comparison of Accuracy Average Accuracy Rate Naïve Bayes 77% K-Nearest Neighbour 70% J48 85% 2.2.4 Proposed Algorithm Proposed algorithm performs the clustering as well as the classification. Firstly the data is clustered by using the K-means Clustering then the linear regression is applied to classify the clusters and the data elements within the clusters. This process can be explained by the following algorithm: 1. Input Data set. 2. Create Number of Clusters, initially 2 clusters are created. 3. Determine the centroid of the Clusters. 4. Determine Distance of objects to centroid using Figure 1: Soil Dataset The dataset is opened using the WEKA. 5. Group the Objects based on minimum distance. 6. If any object move group then go to step 3. 7. Centroid of Clusters act as input for linear classification, linear classification is done by where x is the input and y is the output. a0,a1 are the constants. 8. After classifying all the clusters, classify the elements within the clusters using the linear classification. Figure 2: Open dataset Using WEKA 3.2 Soil Classification Using Various Classifiers Now apply the classification using the classify tab and classify the soil dataset using J48 classifier. 3. Results 3.1 Soil Dataset The soil dataset has 10 attributes and a total 49 instances of soil samples and the dataset prepared in Excel is saved into.csv file to allow them to be applied to WEKA. Figure 3: J48 Tree Soil is classified by using J48 classifier into 5 fertility classes very low, low, medium, high, very high.j48 classifier out of 49 instances classify 44 144

instances correctly and 5 instances incorrectly in 0.06 sec. Table 3: Comparison of Correctly and Incorrectly Classified Instances Algorithm Correctly Classified Incorrectly Classified Time(in seconds) J48 85.4167 14.5833 0.06 Cla 95.9184 4.0816 0.03 Figure 4: Classification by J48 The clustering can also be applied by using the cluster tab. Figure 5: Simple K-mean Clustering using WEKA After clustering of attributes soil is classified into 5 fertility classes very low, low, medium, high, very high. This algorithm out of 49 instances classifies 47 instances correctly and 2 instances incorrectly in 0.03 sec. Figure 6: Classification of Clusters of Attributes The classification applied using the proposed algorithm and J48 algorithm. The table compares the results. The table denotes that the proposed algorithm increases the accuracy as the correctly classified instance are increased and time for classification remains stable. 4. Fertilizer Recommendation This section include that after classification of soil into different fertility classes expert determine what are the deficiencies in the soil, which type of fertilizer should be used for that type of soil and which type of crop is best in that particular type of soil. This facilitates the optimum growth and obtaining the yield potential of the crop. Now fertilizer can be recommended according to fertility level. Soil can be classified on the basis of ph values. When ph value of soil is less than 4.5 then the soil level is very strongly acidic. Scientists recommend that this type of soil is too acidic for most of the crops. To improve its ph it is required to add liming material that has a minimum concentration of 9% Mg. When ph value is in between 4.5 to 5.2 then it is denoted as strongly acidic which also not good for many crops thus to improve its ph liming material having minimum concentration of 9% Mg is recommended. ph value between 5.3 to 6.0 is moderately acidic to improve the ph value of such soil liming material with 3.6% to 9% is necessary. When soil is considered as slightly acidic when ph value is in between 6.1 to 6.9 which is satisfactory for almost crops. But as the ph value increases to the range of 7.0 to 7.5 then soil becomes slightly alkaline which is considered as optimum for most of the crops. As the value of ph increases the alkalinity of soil also increases. When ph value is in between the range of 7.6 to 8.2 the level of soil is moderately alkaline. To drop this type of soils ph gypsum or sulfur are added as it is too alkaline for many crops. Gypsum or sulfur is again recommended when the soil is strongly alkaline with the ph value of 8.6 to 9.0. When ph value reaches up to the value of 9.0 or high then soil becomes highly 145

alkaline which is not suitable to many crops to grow. So we recommend adding liming material having minimum concentration of 9% Mg to the ph ranging up to 6.0. While ph value is less than 7.0 but above 6.0 then recommended solution is to add liming material having concentration of 3.6% to 9.0% of Mg. and for the value of ph up to the range of 7.1 to 9.0, gypsum or sulfur are recommended to add in the soil for making it suitable for the crops. Figure 7: Fertilizer Recommendation 5. Conclusion The research undertaken showed that data mining has advantages and can be easily applied to the soil data set to establish patterns in the data. The application of the WEKA data mining platform provided an easy and quick method for the cluster analysis. The platform provides a number of clustering algorithms that can be used for different tasks. This research aims at analysis of soil dataset using data mining techniques. It focuses on classification of soil using various algorithms into different fertility classes and compare the correctly and incorrectly classified instances. On the basis of fertility classes expert can determine what are the deficiencies in the soil and which fertilizer should be used to overcome that deficiency. Analysis Using Classification Techniques and Soil Attribute Prediction 2012. [2] Prof. Chandrakanth. Biradar, Chatura S Nigudgi An Statistical Based Agriculture Data Analysis International Journal of Emerging Technology and Advanced Engineering, ISSN: 2250-2459, Volume 2, Issue 9, September 2012. [3] Latika Sharma, Nitu Mehta, Data Mining Techniques: A Tool For Knowledge Management System In Agriculture, International Journal Of Scientific & Technology Research Volume 1, Issue 5, ISSN: 2277-8616, June 2012. [4] Joyce Jackson, Data mining : A conceptual overview, Communications of the Association for Information Systems Volume 8, 267-296,2002. [5] WEKA, Wikipedia, March 2013. [6] Professor Tom Fomby, Naïve Bayes Classifier, April 2008. [7] Professor Tom Fomby, Naïve Bayes Classifier, April 2008. [8] Naïve Bayes, Wikipedia, March 2013. [9] C4.5 (J48), Wikipedia, March 2013. [10] Saravanan Thirumuruganathan, A detailed introduction to K-Nearest Neighbor algorithm, may 2010. [11] K-Nearest Neighbor, Wikipedia, March 2013. [12] Kaushik H. Raviya, Biren Gajjar, Performance Evaluation of Different Data Mining Classification Algorithm Using WEKA, Paripex - Indian Journal Of Research, ISSN - 2250-1991, Volume : 2, Issue : 1, January 2013. [13] M. S. Chen, J. Han, P. S. Yu. "Data mining: An overview from a database perspective",ieee Trans. Knowledge and Data Engineering, 8:866-883, 1996 [14] http://en.wikipedia.org/wiki/agriculture [15] http://www.alc.gov.bc.ca/alr/what_is_ag_land.ht m [16] http://www.cropsreview.com/what-isagriculture.html [17] Ramesh Vamanan, K.Ramar Classification Of Agricultural Land Soils A Data Mining Approach, International Journal on Computer Science and Engineering (IJCSE) Issn 0975-3397 Vol. 3 No. 1 Jan 2011. References [1] Jay Gholap, Anurag Ingole, Jayesh Gohil, Shailesh Gargade, Vahida Attar Soil Data 146