A classifier is expected to face with many datasets with different characteristics such as being unbalanced. Besides, their missions are different, for example, categorizing data into just two classes or more than two. There are many different parameters for evaluating the performance of a...
I have a bunch of data in the following format (sorry, it does not format well on here):
Key Indicator-----------First Group------------Second Group
1 Grade change...
I'm doing research on business owners, who have started multiple businesses. I had a sample of 2500 businesses and looked at the business owners' success rate from their first to last business. That gave me two groups: 1) business owners who were SUCCESFUL in their first business...
I was wondering if anyone can assist me with this issue.
I am building a logistic regression model to predict purchase or not purchase based on web site behaviour data.
One of the factors that I would like to include in the model is the visits to purchase and the days to purchase...
I am working with a very large dataset and I plan to merge in more years of similar data; however, the data will be panel data so I need to rename the variables with a suffix. I have worked all day on writing script (even started renaming each individual variable but gave up at #409 out...
I am looking at Coursera data anlysis specializations and most (basically all except John Hopkins) offer Python as the language of choice for their courses. I really like R but I started to wonder if I am missing something? I just see no advantages in Python for data analysis, that would...
I have SPSS 22 x64, and SAV file with about 400 cases. Syntax is
FREQUENCIES VARIABLES=var01 var02
/DESTINATION FORMAT=OXML OUTFILE="xmlout.xml".
XML file appears with 0 size, than, after some time, it becomes bigger, but unfinished. For...
So I am trying to call python from R so I can use beautiful soup. I have tried both >system("python myfile.py")
>python.exec("import bs4 some more python script here")
The issue I am running into in both cases is I get back "No module named bs4"
I think the...
Im converting a code written in PYTHON to STATA and I'm stuck at the point when have to apply the iterative procedure. Cannot find the way to write the loop so that it includes the initial value and generates an estimate for lat.
Has any of you got any experience with that...
I am trying to implement the gibbs sampler found here
on page 175. It is written in R but I am trying to write it in Python but running into problems.
Here is my python code
from numpy import *
I have a semester project to develop a tool using Python which implements Kulldorff's algorithm and finds clusters in point (geo)data.
Currently it is a mess: I understand Kuldorff's algorithm upto the part before it dives into Bernoulli & Poisson. I imagine I have to call certain...
I create a web page for conducting surveys (php) and there is also a module on conjoint analysis (orthogonal plan, two attributes at a time approach, trade-off matrix approach, etc) and I have a question: do you have any tips? I heard about a SPSS plugin for Python, has anyone used this...
I have a large file (2 GB) with 4 columns that I want to read and extract some info from. The data looks like this:
CHR START END A
1 10583 10583 0.14
1 10611 10611 0.02
1 13302 13302 0.11
I also have another file from where I have extracted a string to be compared with...