Categories
Training & Community

New Stars Of Data 6 Preparations: August

In this post, I talk about my August preparations for the upcoming New Stars Of Data 6 event in October 2023.

Table of Contents

Introduction

Last month, I shared the news that I’m speaking at the next New Stars Of Data event in October:

2023 08 11 NewStarsOfDataSchedule

I explained how I prepared my submission and outlined what I’d been up to in July. Now it’s time for an August update!

Presentation Slides

In this section, I examine how the presentation is going and what I’ve spent my time on. Broadly, this breaks down into finishing the opening slides and preparing the Data Wrangler demo.

Opening Slides

This was probably the most important August job for the New Stars Of Data preparations. Before I could properly start work on the rest of the presentation, I needed to have the opening in a state where I could practise it. Additionally, Olivier was expecting a demo of it at the end of the month!

The opening slides do a couple of jobs for me:

  • Letting viewers know they’re in the right place.
  • Setting my agenda.
  • Introducing the Sale Sizzlers and the associated data.
  • Introducing myself!

They also outline the insights I’m going to create, which meant I could start work on…

Demo Storyboarding

A big part of the session will be the VS Code Data Wrangler demo. I want to present the extension in the best possible light and make the session worth everyone’s time, so the more polished the demo the better!

I spent some time on GitHub and YouTube researching Data Wrangler’s capabilities. Microsoft Product Manager Eren Orbey talks about Data Wrangler in several YouTube videos, including this Guy In A Cube video that was published right in the middle of August!

Armed with these insights, I considered the wrangler operations that made the most sense with the Sale Sizzler data. I now have a sequence of operations that add value to the data and showcase Data Wrangler well.

What remains now is practice. Lots of practice!

Technology Testing

In this section, I examine the technology I’ll be using to deliver the presentation.

Current Tech

My current laptop is from 2016. Performance-wise, it’s been upgraded with more RAM and an SSD and runs like a dream. However, the webcam and microphone are firmly rooted in the pre-COVID times when such devices weren’t as big a deal as they are now.

Upon testing, the microphone seems to be fine. The webcam? Not so much.

Teams HPTrueVision
Laptop: HP Truevision

Okay, so decent visual quality isn’t a condition of the New Stars Of Data slot. And I could even turn the camera off if necessary. But I’ve been considering a new webcam for a while now anyway, and this seemed like an ideal time to take a look at the webcam market.

I also found that my practising sessions felt better on my work laptop, which has a better webcam. Not having to deal with a potato camera that makes it look like I’m on the surface of the sun is a big confidence boost.

New Camera

Firstly, I browsed the website of UK retailer Currys. Currys has a good webcam buyers guide, and their online store has extensive filters to drill down into brands, costs and features. After some time on the Currys, TomsGuide and Wired sites, I shortlisted these cameras:

Armed with this list, I turned to YouTube for some hands-on reviews. Several creators have made videos that try out and compare these cameras, and the footage was very helpful when making a decision.

Of particular use was this BadIntent video:

And this Andyhhhdx video:

I ultimately chose the Logitech StreamCam because it’s a newer model than the C922, and the visual quality looks better than the Kiyo. Finally, I shopped around for the best price, arriving at a very appealing offer on Amazon:

2023 08 29 LogitechStreamCam

Initial results from the StreamCam are encouraging!

Test LogiDefault
Logitech SteamCam: Default

Logitech also offers free software to support their cameras. The Logitech Capture app features camera customization settings and an OBS-like video editing and streaming environment. I’m going to spend some time with it in September to find the optimum setup.

New Lighting

I also wanted to upgrade the room lighting. This was less of a task, as I bought some Calex smart bulbs earlier in the year and had one spare! After ten minutes up a ladder, the existing energy-saving bulb was swapped out for a smart one. The Calex app lets me control the brightness and colour of the bulb, and I can save the optimal settings. Sorted!

Bulbs500

I was also fortunate to randomly find some end-of-season vlogging kits for sale! Of particular interest were the clip-on ring light, the tripod and the mountable LED lamp:

PXL 20230907 160448909500

Best part of all – £24 of kit for a tenner!

PXL 20230907 160402099500

Not bad for a beginner setup!

Mentor Check-In

I concluded August with a progress report and session demo to Olivier. It went mostly well, and Olivier had suggestions for my problem areas and optimisation tips for other slides. He also noticed the camera improvement, which is good to know!

I feel like I’m about where I expected to be, but there’s still plenty of work to do. My September jobs are:

  • Consider Olivier’s August feedback and update slides accordingly.
  • Finalise the Data Wrangler slides and practise the demo.
  • Create files to use in case of emergency or technical difficulties.
  • Optimise my streaming environment.

Summary

In this post, I talked about my August preparations for the upcoming New Stars Of Data 6 event in October 2023. Not long to go now!

If this post has been useful, please feel free to follow me on the following platforms for future updates:

Thanks for reading ~~^~~

Categories
Training & Community

New Stars Of Data 6 Preparations: July

In this post, I talk about my July preparations for the upcoming New Stars Of Data 6 event in October 2023. Yes – it’s now the middle of August. I’ve been very busy!

Table of Contents

Introduction

I’m speaking at the next New Stars Of Data event in October! I have a Sessionize profile now and everything!

2023 08 11 NewStarsOfDataSchedule

Back in December, I said that I wanted to try improving my presentation skills in 2023. I found out about New Stars Of Data when they advertised their May 2023 event, and when they opened the NSOD6 Call For Speakers I decided the time was right to have a go myself!

But wait. What’s New Stars Of Data?

Nom Sharks On What?

New Stars Of Data is an event focused on the tuition and promotion of new speakers in the Microsoft space. It is run by Ben Weissman and William Durkin, and is supported by a team of experienced speakers. There have been five events at the time of writing, with the sixth scheduled for October 27 2023.

NSOD open their Call For Speakers roughly every six months and announce it on their Twitter feed. There are four criteria that all applicants must meet:

  1. You have never spoken at a large, public event before (User Groups/Meetups do not rule you out!).
  2. Your presentation is on a topic in the Microsoft Data Platform world.
  3. Your presentation is in English.
  4. Your presentation will fill the allotted time (60 minutes).

Successful applicants have an experienced speaker assigned to them as a mentor. The mentor supports the newcomer through the process, coaching them in all aspects of creating and delivering their session.

Not sure if this is the case for every event, but I also received a cool New Stars Of Data t-shirt for taking part!

Creating My Session

In this section, I cover how I came up with my session’s topic and how I wrote my abstract.

Choosing The Topic

Once I decided to submit a session, I needed to decide what it would be about! While I specialise in AWS (amazonwebshark being a bit of a giveaway) I use many Microsoft products in a typical working week. Two of these are Power BI and Visual Studio Code. By chance, I was starting a new post when the Call For Speakers was announced. This post was about a recent VS Code extension called Data Wrangler.

Data Wrangler is a no-code data preparation and cleaning tool. It uses Python and the Pandas and Regex libraries to provide on-demand data operations, and uses Excel and Power Query technology to enable data profiling, data quality checks and the visualisation of data distributions.

Changes are presented in real-time, with a Git-like interface showing the original and updated data. This lends itself very well to demos! Data Wrangler also offers several export methods for the transformed data, one of which is CSV. This is perfect for Power BI visuals! The session was taking shape!

Finally, I needed a data source. This decision was easy, as I’ve already been using the race results from the 2023 Sale Sizzlers events for another project. The Sale Sizzlers are a series of four 5k running events that place over the Summer, the results of which are freely available as CSVs on the Nifty platform.

Having chosen a topic for the session, I needed to write an abstract for it. So what’s that?

Writing The Abstract

A session abstract is a brief summary of a session or talk. It typically provides an overview of the session topic, the services used and the key takeaways for attendees.

When writing mine, I took advice from a Brent Ozar post and a Johan Ludvig Brattås video. The result was an abstract that was technically accurate but felt a bit flat. So I turned to the tool du jour ChatGPT.

I supplied my abstract and asked ChatGPT for improvements. The results were…mixed, ranging from cliché city to word salad. Some highlights:

  • “…effortlessly assimilating the race results…”
  • “Reveling in the effortless efficiency…”
  • “…fervent sports aficionados…”
  • “Seize this opportunity to join forces in this compelling expedition of knowledge!”

None of which I could say with a straight face, so I turned those down.

I did like some of ChatGPT’s suggestions though. Ultimately, an abstract is as much an advert for a session as it is a description of one, so there needs to be some marketing and persuasiveness in there somewhere.

My finished abstract is on Sessionize and is included below:

In this session, I explore the capabilities of the Visual Studio Code Data Wrangler extension and Microsoft Power BI using real results from the Sale Sizzler 5k race series. I’ll uncover valuable insights through engaging visualisations and user-friendly and low-code data transformations.

This session will cover the following key steps:

– Getting started by setting up a Visual Studio Code environment and seamlessly importing the race results.

– Discovering the convenience of the Visual Studio Code Data Wrangler extension for effortlessly transforming and cleaning the race results

– Taking a closer look at the Python code generated by Data Wrangler to understand what’s happening behind the scenes.

– Loading the transformed race results into Power BI to generate informative visualisations and analyse trends.

Join me if you’re a data professional, a budding analyst or a sports enthusiast!

So ChatGPT wasn’t marking its own homework, I asked Google Bard what it thought of this abstract. It responded:

I think your abstract is very well-written and informative. It clearly states the topic of your session, the speaker’s qualifications, and the key takeaways for attendees.

Well, it made the robots happy. It also pleased Ben and William, as they accepted my session!

So who’s mentoring me?

My Mentor

My mentor is Olivier Van Steenlandt. Upon discovering this, I thought his name looked familiar. It turned out he’d written an Azure DevOps Pipelines post I’d seen the week prior! Small world!

Fotoshoot D1 33 681x1024 1

Olivier is a BI professional specialising in Microsoft. He has substantial experience with SQL Server, Power BI and Azure, and is currently a BI Team Lead.

Since his first session at Datagrillen 2022, Olivier has presented at several events internationally and is currently delivering a session about migrating from SSRS to Power BI Paginated Reports. I enjoy our conversations and he’s given me lots to think about!

Progress Update

In this section, I cover the specifics of my July preparations for my New Stars Of Data session.

I had my first meeting with Olivier in mid-July and got my first jobs! Jobs like “Start getting familiar with Zoomit” and “Please close some of your fifty million Chrome tabs”. I also agreed to start working on the start and end of the session and to decide on its general flow.

To begin, I consulted the New Stars Of Data Speaker Improvement Library and watched Steve Jones‘s “Creating a Slide Deck from an Idea” video. I then reached out to Steve, who happily supplied his example deck for me to review. Thanks for all your help Steve!

Next, I watched Rob Sewell‘s “How do you do that? Remote Presentations.” video. While I expected a video like Steve’s, Rob focused more on his equipment. I hadn’t even thought about this! In August I plan to test my laptop’s microphone and webcam to see what the output is like.

In my August meeting with Olivier, I demoed my starting and summary slides and showed my new-found Zoomit skills. I committed to some extra tasks besides equipment testing:

  • Finalise the start and end of the session.
  • Decide on and start producing the main section and demos.
  • Practise the session opening with a view to presenting it to Olivier in September.

Practising will work well with the equipment testing, as I’m going to have lots of disposable footage over the next few weeks…

Summary

In this post, I talked about my July preparations for the upcoming New Stars Of Data 6 event in October 2023. I’ll be posting further updates in the run-up to October, so watch this space!

If this post has been useful, please feel free to follow me on the following platforms for future updates:

Thanks for reading ~~^~~

Categories
Training & Community

T-SQL Tuesday #160: Round-Up

In this post, I write a round-up of the community responses to my March 2023 T-SQL Tuesday #160 invitation: Microsoft OpenAI Wishlist.

tsql tuesday

Table of Contents

Introduction

Earlier this month, I hosted March 2023’s T-SQL Tuesday with an invitation concerning the ongoing Microsoft and OpenAI partnership:

What is on your wishlist for the partnership between Microsoft and OpenAI?

What follows is some commentary on, and links to, each of the responses.

Chad Callihan

Chad’s post considers potential PowerShell-OpenAI functionality, which would write scripts in response to user prompts. PowerShell is a mainstay of many data professionals, enabling modules like dbatools, Pester and the AWS, Azure and GCP SDKs. An AI with access to the PowerShell Gallery would be very helpful.

Chad also points out some security concerns linked with ChatGPT use, which are good advice in general:

Chris Johnson

Chris’s post considers an AI model for file ingestion. Data pipelines frequently rely on source data with specific types and layouts. Unfortunately, source data can change between ingestion times.

At best, this breaks pipelines and causes problems and downtime for data teams. At worst, incorrect data is ingested causing potential business and customer detriment.

A no-code AI model would save hours of work if it considered previous source data and could make decisions like “This column is formatted as VARCHAR, but yesterday it was DATETIME2 and has hyphens in the right place, so I’ll CAST it as DATETIME2 today and raise a warning in the log.”

Rob Farley

Rob’s post was partially written by ChatGPT! Rob takes a pragmatic approach to AI’s progress and draws a Clippy analogy. I really want to see this AI family tree now.

ChatGPT suggests that it can help with Excel formulae, SQL Server optimization and PowerPoint visuals. It also wants to democratize technology interaction and remove traditional barriers to entry.

Steve Jones

Steve’s post imagines the next generation of AI personal assistant. One that can:

  • Learn from the user and correct common errors in all applications.
  • Suggest code optimizations in a variety of IDEs and languages.
  • Learn the user’s schedule and create automated calendar events and reminders.
  • Recognise repeat tasks and create related automation.

Summary

In this post, I wrote a round-up of the community responses to my March 2023 T-SQL Tuesday #160 invitation: Microsoft OpenAI Wishlist.

Thanks to everyone who contributed to my first T-SQL Tuesday invitation. It was great to read your responses! Anyone interested in hosting future events should contact Steve Jones.

If this post has been useful, please feel free to follow me on the following platforms for future updates:

Thanks for reading ~~^~~