Programming in Scala
Martin Odersky, Lex Spoon, et al.
4.7 on Amazon
42 HN comments
The Art of Doing Science and Engineering: Learning to Learn
Richard W. Hamming and Bret Victor
4.7 on Amazon
40 HN comments
The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World
Pedro Domingos
4.4 on Amazon
40 HN comments
Operating Systems: Three Easy Pieces
Remzi H Arpaci-Dusseau and Andrea C Arpaci-Dusseau
4.7 on Amazon
40 HN comments
Start with Why: How Great Leaders Inspire Everyone to Take Action
Simon Sinek
4.6 on Amazon
36 HN comments
Java Concurrency in Practice
Brian Goetz , Tim Peierls, et al.
4.7 on Amazon
34 HN comments
Countdown to Zero Day: Stuxnet and the Launch of the World's First Digital Weapon
Kim Zetter, Joe Ochman, et al.
4.7 on Amazon
34 HN comments
Managing Humans: Biting and Humorous Tales of a Software Engineering Manager
Michael Lopp
4.4 on Amazon
33 HN comments
The Innovators: How a Group of Hackers, Geniuses, and Geeks Created the Digital Revolution
Walter Isaacson, Dennis Boutsikaris, et al.
4.6 on Amazon
31 HN comments
Elements of Programming Interviews: The Insiders' Guide
Adnan Aziz , Tsung-Hsien Lee , et al.
4.6 on Amazon
31 HN comments
Accelerated C++: Practical Programming by Example
Andrew Koenig , Mike Hendrickson, et al.
4.2 on Amazon
31 HN comments
The Ascent of Money: A Financial History of the World: 10th Anniversary Edition
Niall Ferguson
4.5 on Amazon
30 HN comments
Programming Rust: Fast, Safe Systems Development
Jim Blandy, Jason Orendorff, et al.
? on Amazon
28 HN comments
Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython
Wes McKinney
4.6 on Amazon
28 HN comments
Think Python: How to Think Like a Computer Scientist
Allen B. Downey
4.6 on Amazon
27 HN comments
piqufohonMar 25, 2013
Tarq0nonMar 3, 2019
[1] https://github.com/ageron/handson-ml
wcbeard10onMay 6, 2014
http://shop.oreilly.com/product/0636920023784.do
benawabe896onNov 7, 2012
ploikaonNov 22, 2016
jmdukeonMar 8, 2014
http://books.google.com/books?id=JtJAkfzds4wC&lpg=PA46&ots=p...
Seeing as the 'author' of this post was the submitter, I'd love to hear an explanation of what happened here.
dwrench07onAug 30, 2017
It is a great compliment to Wear McKinney's "Python for Data Analysis" it is more like a recipe book than the internals as Wes' book is. Also, JVP includes more than just Pandas and NumPy goodies.
Highly Recommend, and fork to create your own curated handbook.
cgopalanonFeb 5, 2013
http://shop.oreilly.com/product/0636920023784.do
I have the book and its been great reading so far. Ch 4 gives a nice introduction to Numpy (about 30 pages). Concise but also useful for immediate real-world usage.
throwaway13qf85onMay 1, 2014
I would probably call the selection of topics "finance basics that no one in industry uses any more".
It may be suitable as a beginner's guide if the quality of the exposition is good enough, but my suspicion is that you would be better off getting a copy of "Python for Data Analysis" and a decent quant finance textbook.
dbeckeronJune 29, 2012
Anyone reading this who wants to get started with Pandas: The early release of "Python for Data Analysis" (http://shop.oreilly.com/product/0636920023784.do) is already very helpful.
LolWolfonNov 19, 2016
If you're more like me on using numerical methods, then I'd recommend first going through the math (I can be more specific, depending on what kind of work you'd like to do/are interested in) using Julia/MATLAB/Octave and then jump into NumPy stuff.
In my experience, I've found that using NumPy/SciPy numerical libraries effectively requires some knowledge of the underlying numerical algorithms since the default arguments sometimes require tweaking.
mud_dauberonDec 3, 2013
Structure & Interpretation of Computer Programs (Abelman, Sussman - Lisp oriented, classic)
Algorithms in a Nutshell (OReilly)
Clever Algorithms (Jason Brownlee. Uses Ruby for examples)
Data Mining Analysis - Fundamental Algorithms (Mo. Zaki - ebook)
Python for Data Analysis (Wes McKinney)
Mining of Massive Datasets (Rajaraman, Leskovec, Ullman - ebook)
Enjoy.
pjungwironJuly 17, 2015
These are three very good O'Reilly books that all use Python:
- Programming Collective Intelligence: A broad and shallow survey of automated machine learning techniques.
- Data Analysis with Open Source Tools: Also a survey. More focused on manual data exploration.
- Python for Data Analysis: A pandas tutorial (and more). Very helpful to learn the ML tools in the python ecosystem.
Fitting all that into two months sounds challenging.
a3nonMay 30, 2016
-What important truth do very few people agree with you on?
That the first duty of a society is to ensure that everyone is housed, healthy, fed and educated.
-What is a great company that no one has started yet?
I'm not much of a consumer. The only things that come to mind are more like non-profits and foundations dedicated to things like cheap and widely available pharmaceuticals, and other social improvement.
-What are some of your favorite websites?
HN, NYT, fuckinghomepage.com, python.org, stackoverflow, google news, google images, google translate.
-What are some of your favorite books?
Effective Computation in Physics, Python for Data Analysis, The Ringworld Series, Gone With the Wind, Adventures of Frog and Toad, Unix Power Tools, Catch 22, Working by Studs Terkel, A Canticle for Liebowitz, Travels With Charlie, Cannery Row, East of Eden, The Beautiful and the Damned, Time Enough for Love.
-What would you do if you knew that you only had 24 hours left on Earth?
Go outside.
-What do you really believe in?
I marvel at the unfathomable size and age of the universe, and the effectively zero chance that I could ever exist. I believe that when I look out at the universe, in at myself, and around at my world and fellow beings, that it's an instance of the universe observing and knowing itself. I also believe that we humans are wasting our world and our civilization, that this could be a true paradise of love, respect and potential, and I'm skeptically optimistic that that could come to pass.
pav3lonNov 16, 2012
Edit: Sorry, not a referral link.
11001onJan 25, 2013
birkenonOct 4, 2012
One caveat I would mention about data analysis would be that statistics is not just number crunching. It is really a bit of an art to making sure you are looking at the right sample of data in the right way, and ensure you are accounting for all potential biases. Surprisingly, I have noticed as I've gotten more experience doing data analysis, it takes me longer to do and I make less confident assertions. But on the other hand, I now very rarely make assertions which were incorrect, which is extremely important. I believe that incorrect data analysis is significantly worse than no data analysis.
So, the advice I would give to people getting started is whenever you come to a conclusion by analyzing a particular piece of data, ask your "if I look at the data differently, can I come to the opposite conclusion?". You would be surprised how often the answer to this question is yes, and that is a good indicator that you a) need more data or b) cannot make a significant conclusion. This can be especially difficult when you are already sure you know the answer to a question even before you do the data analysis, but you really have to be disciplined about it.
mtmailonJune 6, 2020
While the https://tailwindcss.com/ CSS library is open source, the authors make money by selling access to a component library https://tailwindui.com/
jackbrianonMar 14, 2017
I learned the hard way that it is quite difficult to break into finance as a non-student, so do everything you can now to land that first gig. Good luck!
Some starting resources:
-Ernie Chan's books and blog (https://epchan.blogspot.com/)
-QuantStart has great starter material and a new book, although I haven't read it (https://www.quantstart.com/)
-"Inside the Black Box" (Narang) I've seen referenced a good bit but felt as though it leaned toward order execution and rather boring
-"Dark Pools" (Scott Patterson) a great story about the rise of algorithmic trading
-"Flash Boys' (Michael Lewis) offers a nice follow up (HFT), but considered a bit sensationalist
EDIT: If you're planning on using Python (a solid bet)...
-Python for Data Analysis (Wes McKinney) - Great, quick book for Pandas by former AQR (and now Two Sigma?) guy.
-Yves Hilpisch books: "Python for Finance" is introductory while "Derivative Analytics in Python" is quite math heavy.
tomkuonDec 17, 2012
Looks like you're on the right track though, "Think Stats" and Udacity's stats class were the main things I was going to recommend. I'd also recommend checking out IPython's web notebook for inline charts and general awesomeness, and the Pandas library for an R-style data frame built on top of NumPy. The best resources for learning about IPython are probably screencasts, and the author of Pandas has a book out named "Python for Data Analysis" that covers IPython, NumPy, Pandas and some matplotlib.
rmbibeaultonOct 4, 2018
Remote: Yes, or onsite
Willing to relocate: yes
Technologies: Python, C, C++, rust, Linux (Ubuntu, Raspbian), Scheme (LISP), git, macOS
Resume: https://drive.google.com/file/d/1XVKev1z8WF3IpKJCzV4i3XSlBpE...
Email: on resume, RichardMBibeault@gmail.com
================
Recent physics graduate, math minor (bachelors of science), seeking employment in data science or software engineering. Highly ambitious, enjoys self directed learning and difficult problems.
I quite enjoy the learning process, and especially working on complex problems, breaking them down into their component pieces
, and figuring out a best approach for each piece.
Currently reading Python for Data Analysis, and Effective Modern C++.
Recently read a good chunk of TCP/IP administration 3rd edition, Land of Lisp, and Data Science from Scratch.
JPKabonFeb 12, 2013
I could go on and on, but the short order is this: Pandas has an object called a DataFrame which maps 1 to 1 with a sheet from an Excel file. You can pull in two sheets of data from Excel, do merges, joins, iterations, you name it. I am a professional who also "has a job to do", and after two 1 hour sessions in the evening with Pandas, I immediately began using it at work to manage the huge quantity of spreadsheets my team is generating, combined with output from various databases. Anyone who knows SQL will love Pandas, because it allows you to very simply and easily conduct SQL style set logic on data without having any of it in a database.
TedPetrouonNov 27, 2017
I wrote a very detailed review of the book, Python for Data Analysis (2nd edition) by Wes McKinney.
Here is a high-level summary:
PDA is written very much like a reference manual, methodically covering one feature or operation before moving on to the next. The current version of the official documentation is a much more thorough reference guide if you are looking to learn pandas in a similar type of manner.
There is very little actual data analysis and almost no teaching of common techniques or theory that are crucial to making sense of data.
The vast majority of examples use randomly generated or contrived data that bear little resemblance to what data actually look like in the real world.
For the most part, the operations are learned in isolation, independent from other parts of the pandas library. This is not how data analysis happens in the real-world, where many commands from different sections of the library will be combined together to get a desired result.
Although the commands will work for the current pandas version 0.21, it is clear that the book was not updated past version 0.18, which was released in March of 2016. This is apparent because the resample method gained the on parameter in version 0.19 which was absent in PDA. The powerful and popular function merge_asof was also added in version 0.19 and is not mentioned once in the book.
There were numerous instances where it was clear that the book was not updated to show more modern code. For instance, the take method is almost never used any more and has been completely replaced by the .iloc indexer. There were also many instances were code snippets could be significantly transformed by using completely different syntax, which would result in much better performance and readability.
One of the most confusing things for newcomers to pandas are the multiple ways to select data with the indexers[], .loc, and .iloc. There is not enough detailed explanations for the reader to walk away with a thorough understanding of each.
cessoronJune 7, 2014
Other than that I'd recommend that you find a domain you are particularily interested in and get some books on the specific aspects of that, for example Python for Data Analysis. http://shop.oreilly.com/product/0636920023784.do
This allows you to focus on what to do with python and the eco system of frameworks and so on.
If you want to learn how to use Python effectively, I believe you should decide what domain to apply it in. Python is no end in itself, as is the case with most other tools...
rmbibeaultonSep 12, 2018
Remote: Yes, or onsite
Willing to relocate: maybe
Technologies: Python, C, Debian-based Linux, git
Resume: https://drive.google.com/file/d/13f_QTtwTMI0UsdxiCprqoThnq2v...
Email: on resume, RichardMBibeault@gmail.com
================
Recent physics graduate, math minor (bachelors of science), seeking employment in data science or software engineering.
Highly ambitious, enjoys self directed learning and difficult problems.
Currently reading TCP/IP administration 3rd edition, Data Science from Scratch, and Python for Data Analysis.
tjtonMay 3, 2016
Python for Data Analysis - http://shop.oreilly.com/product/0636920023784.do - I'm working on this one now.
Data Wrangling with Python - http://shop.oreilly.com/product/0636920032861.do
Fluent Python - http://shop.oreilly.com/product/0636920032519.do
Learning Predictive Analytics with Python - http://shop.oreilly.com/product/9781783983261.do
Data Visualization with Python and JavaScript - http://shop.oreilly.com/product/0636920037057.do
Thanks for any comments!
dbeckeronSep 26, 2012
I expect that user base will increase even faster after Wes McKinney's "Python for Data Analysis" book is published.
joeclark77onNov 16, 2016
Five years ago I would have said "R", but Python enthusiasts have been replicating what R does in Python at a frantic pace, and R will never really replicate what Python brings to the table.