Blog

  • Suit Against ChatGPT

    There is a tussle between creators and algorithms. OpenAI’s ChatGPT faces a copyright violation suit in the US. Generative AI developers are increasingly being dragged to courts of law for infringements of copyrights. As it is, old versions of AI were limited to pattern recognition, whereas generative AI is altogether different. Generative AI models ingest vast amounts of data without permission in the process of being trained. Later, its algorithms synthesize data that resembles the original text used in the training. In fact, it mimics the original creation.

    Authors Guild accuses OpenAI of systematic theft on a large scale. OpenAI is not alone to do so. There are others. It sweeps up content belonging to others without permission. The same data is used for making its algorithms. In addition, generative AI developers at times allow the third party developers to make their own apps on the top of product. The outcome is the cloned product resembling the author’s or artist’s original work, diluting its market value. News media too generate content using this method –these are vulnerable.

    India is a multi-lingual country. It has its own challenges. There should be international regulations for generative AI. In the absence of this, judiciary will come out with different one off solutions. The EU is attempting to avoid this. It expects generative AI to publish summaries of copyright materials used.

    India should be proactive here to facilitate the formulation of global regulation. India will lose a lot as a creator of original content, if it does not act proactively.

  • LLM Parameters and Learning

    Often we hear that an LLM has a billion or even trillion plus parameters. These parameters are numerical values, say weights and biases which influence the connections and activations of neurons in the model. The more the parameters, the more expressive and effective an LLM is. However, it requires more data and computational resources for training.

    OpenAI’s GPT4 has 175 billion parameters. Google’s PaLM2 has 3.6 trillion parameters.

    Weights are assigned based on the relevance and context of words. Words are converted into word embeddings, or numerical representation of words as vectors. Weights are assigned to these vectors. An LLM learns these words embeddings through massive data sources, internet, books, news articles etc.

    LLMs use attention mechanism — this mechanism computes weights for vectors by using multiple attention heads. Each attention head has its relevance criterion. This mechanism makes LLM focus on most important words both in the input and output sequences. This captures long-range dependencies and complex relationships between words.

    Weights give us the weighted average of the word embeddings. This weighted average is fed into feedforward neural network. The output could be a probability distribution over the next word in the sequence or a vector representation of the input sequence that could be used for NLP tasks.

    A backward pass in a neural network is used to adjust weights of the network so as to minimize loss function. It is also called backpropagation. In fact, it propagates error from the output layer to the input layer. Along the path, the weights are adjusted using gradient descent.

    In the training of LLM, the first step is a forward pass. Backward pass is the second step. Forward pass computes output from input data. It was the currently assigned weights. Its outcome is a prediction, which is compared to the desired target value to calculate the loss function.

    Backward pass finds the optional weights to reduce the loss function. It has to assess how much each weight contributes to the loss function. It uses a chain rule to compute derivative of a complex function by breaking it into simpler functions. The derivative indicates how a small change in a variable affects another variable.

    Backward pass moves from output to input layer. Each weight is adjusted by substracting a fraction of its derivative from its current value. This fraction is known as learning rate. It shows how fast or slow the network learns.

    Backward pass is repeated for each batch of data till network reaches a satisfactory level of performance or a maximum number of iterations.

    Vaswani et al (2017) paper Attention Is All That You Need made the researchers use Transformer model for NLP, leaving the old RNNs and CNNs. There is parallel processing here. It is a scalable model. It can capture long-term dependencies and context. It is a scalable model. It can capture long-term dependencies and context. It talked about scaled dot-product attention and multi-head attention. It injects positional information into the word embeddings to preserve the order of words in a sequence. It is called positional embedding and sinusoidal position representation. It also introduces the concept of self-attention and cross-attention. There is layer normalization and there are residual connections.

  • Consultancy Firms and AI Solutions

    Big Four accounting and audit firms in consultancy area have adopted AI whole-heartedly. EY has committed an investment of $1.4 billion to develop EY.ai which could be adopted by client orgnisations. Their AI LLM platform provides a secure environment to facilitate AI adoption by clients. PwC India has AI Lab in Gurugram. They have partnered with industry leaders such as Microsoft and OpenAI. Deloitte has unveiled a global AI market incubator in India to foster innovation. They have also tied up with Amazon Web Sevices (AWS) to drive generative AI-driven innovations for Indian businesses.

    Consultancy firms such as McKinsey, BCG and Bain and tech majors such as Accenture and IBM are investing heavily in forming partnerships with IT companies and startups to develop solutions for their clients.

    McKinsey has developed Lilli, an AI tool for consultants. They have partnered with Salesforce. The aim is to develop customised AI-driven consulting solutions.

    BCG has developed BCG X in partnership with Anthropic to provide AI models such as Claude2 to clients. They have collaborated with OpenAI to offer customised solutions.

  • FM Radio

    India has 867 commercial radio stations, including AIR, now called Aakashvani. It has phenomenal reach and is a hyper-local medium. Local retailers contribute 50-70 per cent to the radio operator’s ads.

    India’s radio business has been severely affected by the pandemic, the emergence of streaming music and a host of structural issues.

    Radio listenership and volumes of radio advertising have risen. However, its share in total advertising fell down, from 3.5 per cent to 2 per cent in 2020, and have stayed as it is.

    Advertisers do not see radio as a strategic medium for frequency or reach. Radio operators have taken to event organisation, and this contributes 20-40 percent of their top line.

    Telecom Regulatory Authority of India (TRAI) has made some recommendations to revive FM radio.

    1.There are two fees for FM operator. One time non-refundable entry fee and licence fee. The licence fee is at 4 per cent of gross revenues (including GST) or 2.5 per cent of the one-time entry fee for a city. To illustrate, if the highest bid for a city is Rs.169 crore, the licence fee would be Rs.4.2 crore.

    TRAI now recommends that licence fee should be delinked from non-refundable one-time entry fee. It should be calculated as 4 per cent of gross revenue (not including GST). If the license fees are rationalised, it will help the medium to become cost effective.

    2.The government should support those operators who have been affected by pandemic.

    3.Operators will be allowed to broadcast news and current affairs for 10 minutes per clock hour. This will bring advertisers.

    4. Mobile phones should have their receivers active for FM reception. This should be made mandatory. Telecom companies disable the receivers on handsets to promote their own music streaming service. FM radio can reach over a billion mobile users. The current listenership can easily double.

    Let us see whether the ministry accepts those proposals.

  • Make Cryptos Mainstream

    The G20 leaders were pragmatic enough and had foresight to see what a threat crypto assets posed to mainstream financial markets. The leaders were in favour of globally co-ordinated crypto regulations. They were also in favour of Reporting Framework for crypto assets, They want to have crypto exchanges for different jurisdictions as per co-ordinated timeline. They want oversight of crypto assets.

    It is now clear that none of the G20 countries is inclined to give cryptos a legal tender status.

    Cryptos, as we know, are decentralised digital assets. They have the potential of being misused. In fact, they have already been misused. However, this does not rob them of their convenience as safe haven for the investors. As we know, even regulated assets do not exhibit orderly behaviour.

    In 2022, there was a crash of an FTX exchange, and Bitcoin suffered heavy losses. Later in 2023, there was resurgence. Regulation of crypto assets is not to legitimize them, but to minimise the risks and to respond quickly in the face of the risk.

    G20 directions will accelerate the formulation of the regulatory framework. The RBI in India is in favour of banning the cryptos. It is at the instance of Supreme Court in 2020 that banking facilities have been provided to crypto exchanges. Policy makers want to provide an alternative digital currency (CBDC) here in India. That cannot rule out the co-existence of private cryptos along with the official digital currency.

    Dollar has been the reserve currency of the world. Blockchain based currency might dent the dollar’s position as the only currency to settle global payments.

    Emerging economies such as India are likely to benefit in cross-border settlements in their own currencies to counter dollar’s pre-eminence.

  • Generative AI and Coding

    Gen AI can assist the programmers to write a code. All a programmer has to do is to articulate his problem. Generative AI also makes it easier to create product presentation before clients — the workflows, demo of products through videos and more. It brings quicker approval from clients. Gen AI can help write product notes and cases.

    It is necessary to learn how to get the best from Gen AI by asking the right questions (prompts) to receive the best answers.

    AI-powered coding assistants such as Microsoft’s GitHub Co-pilot and Google’s Studio Bot are a great help to the developers. It is easy to master them, and they easily integrate into development environments(IDEs) — Visual Studio Code, JetBrains and JupyterLab which are frequently used by developers.

    The coding assistants are a great help in ML applications. They read a database table and apply algorithms such as SVM — support vector machines and gradient boosting trees to make inferences. The coding assistant finds the Python libraries and function calls. It can give a ready code to execute.

    In not using coding assistant, you will have to import data, cleanse it, build training and testing datasets, evaluate results and create charts showing results. In the process, the problem being solved loses focus. A coding assistant writes 50-60 per cent of the code for the developer. These coding assistants support Python, C Sharp, JavaScript and other languages.

    Real time coding suggestions are useful, and the developer has the freedom to accept or reject these.

    Gen AI is a blessing to both the experienced and novice programmers. Experienced programmers break a programme for any problem into a set of sub-problems, the solutions of which are already available. It is a productivity booster.

    Gen AI also converts the solution from one language to another.

    Gen AI draws solutions from open source code, and may need fine tuning.

  • AI : Both Risk and Opportunity for Journalism

    AI is both a threat and an opportunity for journalism.

    London School of Economics surveyed over 100 news organisations from 46 countries about using AI and related technologies between April and July, 2023.

    More than half of those surveyed (60 per cent) had concerns about the ethical implications of AI on journalistic values including accuracy, fairness and transparency. Journalism is undergoing metamorphosis on account of technology — the change is exciting, but at the same time scary.

    Around 85 per cent respondents experimented with generative AI such as ChatGPT or Google Bard for different tasks — writing summaries and generating headlines. Almost 60 per cent had reservations. Journalists have recognised the time saving benefits of AI with tasks such as interview transcription.

    Journalists acknowledge the need for checking AI-generated content by a human.

  • AI Tools for YouTube Creators

    Creators in fact bring the audience on line. India is the fastest growing market on account of the creator economy in India. Creators to build multi-million dollar businesses. Creators in fact increase the user base of YouTube and generate more revenue for it.

    Generation of users coming online is the key to growth. Some of these users have only their mobile phones to experience the online offerings. They connect with the communities and content on YouTube. Google engages with the creators both on a one-off basis, and on continuous basis. YouTube’s product teams have direct relationships with creators built over last so many years.

    Those creators with millions of followers and high profile influencers must be on YouTube platform. This helps YouTube to compete with TikTok and Instagram.

    YouTube wants to retain the existing pool of creators and wants to bring new creators in the eco-system. To this end, they are using generative AI. A new tool Dream Screen is being offered. It allows creators to generate videos and photos using AI to use in the background in their shorts. A creator wants to execute an idea — a dragon flying over Mumbai. They have to type a prompt, and AI generates the corresponding video.

    There are AI-generated suggestions for the future videos. There is an automatic dubbing product to translate videos in other languages. There is an AI-assisted music recommendation system. It suggests music (audio) to use for the creator’s video.

    At the root of Dream Screen are the foundation models. YouTube fine tunes some of these models so that creators can use them.

    YouTube Shorts have become competitors to TikTok. These Shorts command 70 billion average views per day. However, it is an issue — how to monetise these Shorts. YouTube is investing heavily in Shorts. This is a part of their mobile first strategy.

  • Predictive Analytics

    In this data-driven world, predictive analytics put us on firm footing. It allows us to draw valuable insights and make rational decisions. Predictive analytics uses data, statistical algorithms and ML techniques to identify the future outcomes based on historical data. The aim is to see what the future would be like based on past data.

    It is a process that uses data analysis, ML, AI and statistical models to identify patterns which predict future behaviour.

    In short, predictive analytics is a technology to make predictions about what is unknown in the future.

    There is wider interest observed in this field in the last five years. Another name for predictive analytics is advanced analytics. It has been linked to business intelligence.

    If assesses risk, business trends and future maintenance requirements. In data science, they use various regression models and ML techniques to do this.

    Predictive analytics lend a certainty to future, and this distinguishes itself from descriptive analytics.

    It is useful in demand forecasting, production planning, insurance claims, software testing life cycle.

    The following tools are well-entrenched in this digital age to do predictive analytics.

    DataRobot

    It is an automated ML platform used in predictive analysis. It enables data scientists to build predictive models. There are pre-built templates in it.

    IBM Watson Studio

    It is a comprehensive platform offering an array of tools for predictive analytics and data science. ML models construction and deployment could be done using its AutoAI feature. It has been integrated to IBM Cloud. It makes it scalable and flexible for all types of business — big, medium or small.

    SAS Analytics

    It has been around for quite some time. It keeps on evolving. It shows advanced analytics capabilities. It is useful in ML, model building and intelligence.

    Tableau

    We have already examined Tableau in detail in previous write-up. It is used for visualization. Its capacities have been expanded to cover predictive analytics. It has added features such as Explain Data and Ask Data. It makes available analytics to non-technical users. It has been integrated to cloud storage and ML libraries.

    Google Cloud AI Platform

    Google Cloud ML as well as cloud infrastructure. There is a suit of tools for Data Scientists. Model building becomes automatic. Being scalable, it is an ideal choice for organizations to leverage predictive analytics.

    Some other predictive analytics tools are IBM SPSS (Statistical Package for Social Sciences), RapidMiner Studio, TIBCO Spotfire, H2O.

    Some techniques of predictive analytics are decision trees, neural networks, text analytics, regression.

  • Poor Math Skills

    Just as movies require superstars such as Shah Rukh and Salman, the nation needs people who are good at mathematics. Those who contribute to nation’s progress are engineers and scientists. The jobs they perform involve a lot of mathematics. The challenges a country like the USA faces requires mathematics. Mathematics enables one to make calculations, analyse data and solve problems. Mathematics is becoming more and more a part of almost every career.

    Of late, many Americans joke about how bad they are at mathematics. They show poor scores on standardised math tests. The advances in technology in the next 50 years are going to come from those countries who have intellectual capital. STEM education must be supported (STEM stands for Science, Technology, Engineering and Mathematics). China and Russia has many more STEM graduates. Such countries may challenge America’s dominance.