Coevolving Innovations

… in Business Organizations and Information Technologies

Currently Viewing Posts Tagged quantitative methods

Learning data science, hands-on

For the Quantitative Methodologies for Design Research (定量研究方法) course for Ph.D. students at Tongji University in spring 2017, Susu Nousala invited me to join the team of instructors in collaborative education in Shanghai.  Experts were brought in during the course to guide the graduate students.

My participation in the course over two days had three parts:  (a) preparing a lecture outline; (b) orienting the students; and (c) equipping the students with tools.

(A) Preparing a lecture outline

While I’m comfortable with the mathematics underlying statistical analysis, I have a lot of practical experience of working with business executives who aren’t.  Thus, my approach to working with data relies a lot on presentation graphics to defog the phenomena.  While the label of data science began to rise circa 2012, I’ve had the benefit of practical experience that predates that.

Today's APL
AGSS: A Graphical Statistical System (1994)

In my first professional assignment in IBM Canada in 1985, data science would have been called econometrics.  My work included forecasting country sales, based on price-performance indexes (from the mainframe, midrange and personal computer product divisions) and economic outlooks from Statistics Canada.  Two years before the Macintosh II would bring color to personal computing, I was an early adopter of GRAFSTAT: “An APL system for interactive scientific-engineering graphics and data analysis” developed at IBM Research.  This would eventually become an IBM program product by called AGSS (A Graphical Statistical System) by 1994.

Metaphor Computer Systems workstation
Metaphor Computer Systems workstation

In 1988, I had an assignment where data science would have been called marketing science.  I was sent to California to work in the IBM partnership with Metaphor Computer Systems. This was a Xerox PARC spin-off with a vision that predated the first web page on the World Wide Web by a few years.  These activities led me into the TIMS Marketing Science Conference in 1990, cofounding the Canadian Centre for Marketing Information Technologies (C2MIT) and contributing chapters to The Marketing Information Revolution published in 1994.

This journey led me to appreciate the selection and use of computer-based tools for quantitative analysis.  Today, the two leading platforms in “Data Science 101” are Python (a general purpose language with statistical libraries), and the R Project for Statistical Computing (a specialized package for data analysis and visualization).  Both are open source projects, and free to download and use on personal computers.  I tried both.  R is a higher level programming language more similar to the APL programming language that gets work done more quickly.  For statistical work, I recommend R over Python (although APL is a theoretically better implementation).

Intro to R Programming, Big Data University
Intro to R Programming, Big Data University, Feb. 22, 2017

Since I live in Toronto, I attended the February session of Data Science with R – Bootcamp in person, at Ryerson University.  There, I was watched Polong Lin leading a class through R using the Jupyter notebook, both in (i) an interactive version, and (ii) a printable version.  Students had the choice to either follow Polong (i) actively, in a step-by-step execution in the Cognitive Class Virtual Lab (formerly called the Data Scientist Workbench) with a cloud-based R session through their web browsers, or (ii) passively, reading the static printable content.

For the Quantitative Methodologies for Design Research (定量研究方法) course for Ph.D. students at Tongji University in spring 2017, Susu Nousala invited me to join the team of instructors in collaborative education in Shanghai.  Experts were brought in during the course to guide the graduate students.

My participation in the course over two days had three parts:  (a) preparing a lecture outline; (b) orienting the students; and (c) equipping the students with tools.

(A) Preparing a lecture outline

While I’m comfortable with the mathematics underlying statistical analysis, I have a lot of practical experience of working with business executives who aren’t.  Thus, my approach to working with data relies a lot on presentation graphics to defog the phenomena.  While the label of data science began to rise circa 2012, I’ve had the benefit of practical experience that predates that.

Today's APL
AGSS: A Graphical Statistical System (1994)

In my first professional assignment in IBM Canada in 1985, data science would have been called econometrics.  My work included forecasting country sales, based on price-performance indexes (from the mainframe, midrange and personal computer product divisions) and economic outlooks from Statistics Canada.  Two years before the Macintosh II would bring color to personal computing, I was an early adopter of GRAFSTAT: “An APL system for interactive scientific-engineering graphics and data analysis” developed at IBM Research.  This would eventually become an IBM program product by called AGSS (A Graphical Statistical System) by 1994.

Metaphor Computer Systems workstation
Metaphor Computer Systems workstation

In 1988, I had an assignment where data science would have been called marketing science.  I was sent to California to work in the IBM partnership with Metaphor Computer Systems. This was a Xerox PARC spin-off with a vision that predated the first web page on the World Wide Web by a few years.  These activities led me into the TIMS Marketing Science Conference in 1990, cofounding the Canadian Centre for Marketing Information Technologies (C2MIT) and contributing chapters to The Marketing Information Revolution published in 1994.

This journey led me to appreciate the selection and use of computer-based tools for quantitative analysis.  Today, the two leading platforms in “Data Science 101” are Python (a general purpose language with statistical libraries), and the R Project for Statistical Computing (a specialized package for data analysis and visualization).  Both are open source projects, and free to download and use on personal computers.  I tried both.  R is a higher level programming language more similar to the APL programming language that gets work done more quickly.  For statistical work, I recommend R over Python (although APL is a theoretically better implementation).

Intro to R Programming, Big Data University
Intro to R Programming, Big Data University, Feb. 22, 2017

Since I live in Toronto, I attended the February session of Data Science with R – Bootcamp in person, at Ryerson University.  There, I was watched Polong Lin leading a class through R using the Jupyter notebook, both in (i) an interactive version, and (ii) a printable version.  Students had the choice to either follow Polong (i) actively, in a step-by-step execution in the Cognitive Class Virtual Lab (formerly called the Data Scientist Workbench) with a cloud-based R session through their web browsers, or (ii) passively, reading the static printable content.

  • RSS qoto.org/@daviding (Mastodon)

    • New status by daviding August 19, 2019
      In the Canadian press, this is attributed to inverted yield curve, resulting from the trade war. > Anyone buying that bond is willingly buying an investment that's guaranteed to lose money, but investors are more than happy to buy it up - because the fear is that alternative investments will fare even worse. [....]> Those […]
    • New status by daviding August 19, 2019
      There's something seriously wrong in the global financial markets, when banks are offering mortgages at zero or negative rates. > Jyske Bank, Denmark's third largest, has begun offering borrowers a 10-year deal at -0.5%, while another Danish bank, Nordea, says it will begin offering 20-year fixed-rate deals at 0% and a 30-year mortgage at 0.5%.> […]
    • New status by daviding August 18, 2019
      Web video of Systems Changes: Learning from the Christopher Alexander Legacy, extending #patternlanguage especially Eishin School and Multi-Service Centers methods-in-practice. For #SystemsThinking Ontario, up the learning curve on ongoing research. http://coevolving.com/blogs/index.php/archive/systems-changes-learning-from-the-christopher-alexander-legacy-st-on-2019-02-11/
    • New status by daviding August 16, 2019
      Web video of presentation of Evolving Pattern language towards an Affordance Language, 2018, on week visiting#RaphaelArar and #JimSpohrer at Almaden. Insider's history of science and prospects http://coevolving.com/blogs/index.php/archive/evolving-pattern-language-towards-an-affordance-language-almaden-2018-05-09/#systemsthinking #patternlanguage
    • New status by daviding August 12, 2019
      Web videos of keynote presentation "Innovation Learning for Sustainability: What's smarter for urban systems" for 2018 International Conference on Smart Cities and Design (SCUD) in Wuhan. http://coevolving.com/blogs/index.php/archive/innovation-learning-for-sustainability-scud-2018-04-21/
  • RSS on IngBrief

  • Recent Posts

  • Archives

  • RSS on daviding.com

    • 2019/08 Moments August 2018
      Enjoyed summer with events in Toronto, followed by trips back my home town Gravenhurst, staying overnight for the first time in over 30 years.
    • 2019/07 Moments July 2019
      Busy month of living every day of the summer to the fullest, visiting family and friends, enjoying the local sights of the city.
    • 2019/06 Moments June 2019
      Summer arrived in Toronto, with the month ending in travel to BC and Oregon.
    • 2019/05 Moments May 2019
      Family time, empty nest, short trip to conference nearby, friends at home.
    • 2019/04 Moments April 2019
      End of a 23-day visit in Shanghai, readjusting to Eastern Time with the many lecture, meetup, friends and family distractions of Toronto.
    • 2019/03 Moments March 2019
      Month of intensive lectures and research meetings, in Toronto and then in Shanghai, with social breaks on local excursions to clear minds.
  • RSS on Media Queue

  • Meta

  • Creative Commons License
    This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License
    Theme modified from DevDmBootstrap4 by Danny Machal